如何计算在Java中字 [英] How to count words in java

查看：116 发布时间：2015/11/30 13:53:50 java algorithm

本文介绍了如何计算在Java中字的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我在寻找一种算法，暗示或任何来源$ C $ C，可以解决我下面的问题。

I am looking for an algorithm, hint or any source code that can solve my following problem.

我有它包含很多文本文件的文件夹。我阅读，并保存在字符串中的所有文本。现在我要来计算，如果有这个词出现在其他文件或没有。（我知道它并不清楚，让我举一个例子）

I have a folder it contains many text files. I read them and store all text in STRING. Now I want to to calculate, if any of the word appeared in other files or no. ( I know its not clear let me give an example)

例如我有两个文件：文档A =>棕色狐狸跳文档B =>狗不跳文档C =>狐狸跳狗

For example i have two documents: Doc A => "brown fox jump" Doc B => "dog not jump" Doc C = > "fox jump dog"

比方说我的程序读取的第一个文档，现在第一个字是棕色现在我的程序会检查是否这个词也出现在任何其他文件？因此，答案是0。现在，它会再次检查第2个字狐狸，它会给输出是它出现在（文件C）等等...... 现在，它会读取文档B和它会检查，如果狗狗出现在其他文件？答案应该是（文件C）等....

Lets say my program read the first document and now first word is "brown" now my program will check if this word is also appeared in any other document? So the answer would be 0. Now it will check again for 2nd word "fox", it will give output that yes it appeared in (Doc C) so on...... Now it will read Doc B and it will check if dog appeared in other document? Answer would be (Doc C) so on....

任何意见或伪code？

Any advice or pseudo code?

提示：它也被称为逆文档频率（IDF）。我知道什么是IDF。

Hint: It is also called inverse document frequency ( Idf ). I know what is idf.

如何计算在Java中字 [英] How to count words in java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

如何计算在Java中字 [英] How to count words in java

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭