查找前 10 个搜索词的算法 [英] Algorithm to find top 10 search terms

查看：16 发布时间：2021/12/22 8:25:16 algorithm data-structures

本文介绍了查找前 10 个搜索词的算法的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我目前正在准备面试，这让我想起了我在之前的面试中被问到的一个问题，内容如下:

I'm currently preparing for an interview, and it reminded me of a question I was once asked in a previous interview that went something like this:

您被要求设计一些软件，以连续显示 Google 上的前 10 个搜索词.您可以访问一个提要，该提要提供当前在 Google 上搜索的搜索词的无限实时流.请描述什么算法和用于实现此目的的数据结构.您将设计两种变体:

"You have been asked to design some software to continuously display the top 10 search terms on Google. You are given access to a feed that provides an endless real-time stream of search terms currently being searched on Google. Describe what algorithm and data structures you would use to implement this. You are to design two variations:

(i) 显示所有时间的前 10 个搜索词(即自您开始阅读提要以来).

(i) Display the top 10 search terms of all time (i.e. since you started reading the feed).

(ii) 仅显示过去一个月的前 10 个搜索词，每小时更新一次.

(ii) Display only the top 10 search terms for the past month, updated hourly.

您可以使用近似值来获得前 10 名的列表，但您必须证明您的选择是合理的."
我在这次采访中轰炸了，但仍然不知道如何实现这一点.

You can use an approximation to obtain the top 10 list, but you must justify your choices."
I bombed in this interview and still have really no idea how to implement this.

第一部分要求在无限列表的不断增长的子序列中出现 10 个最频繁的项目.我研究了选择算法，但找不到任何在线版本来解决这个问题.

The first part asks for the 10 most frequent items in a continuously growing sub-sequence of an infinite list. I looked into selection algorithms, but couldn't find any online versions to solve this problem.

第二部分使用了一个有限列表，但由于处理的数据量很大，你无法真正将整个月的搜索词存储在内存中并每小时计算一个直方图.

The second part uses a finite list, but due to the large amount of data being processed, you can't really store the whole month of search terms in memory and calculate a histogram every hour.

由于前 10 名列表不断更新，所以问题变得更加困难，因此您需要以某种方式通过滑动窗口计算前 10 名.

The problem is made more difficult by the fact that the top 10 list is being continuously updated, so somehow you need to be calculating your top 10 over a sliding window.

有什么想法吗?

查找前 10 个搜索词的算法 [英] Algorithm to find top 10 search terms

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

查找前 10 个搜索词的算法 [英] Algorithm to find top 10 search terms

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭