在 PIG 的嵌套 FOREACH 中使用过滤器 [英] USING Filter in a Nested FOREACH in PIG

查看：26 发布时间：2021/11/12 4:02:51 apache-pig

本文介绍了在 PIG 的嵌套 FOREACH 中使用过滤器的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有两只猪亲戚.第一个 count_pairs 显示单词对以及它们被看到的次数.例如 ((car,tire), 4).第二个是 word_counts，它跟踪每个单词被看到的次数.(汽车，20).我想找出每对被看到的次数与只看到第一个单词的次数的百分比.在我们的例子中，我想要 ((car,tire), 4/20).我试着写一个嵌套的 foreach 来解决这个问题:

I have two pig relations. The first one count_pairs shows pairs of words and how many times they were seen. ex ((car,tire), 4). The second is word_counts, which keeps track of how many times each word was seen ex. (car, 20). I would like to find the percentage of how many times each pair was seen compared to how many times just the first word was seen. In our case I would want ((car,tire), 4/20). I tried to write a nested foreach to solve this problem :

> percent_count_pairs = FOREACH count_pairs {
> denom = FILTER word_counts BY ($0 ==count_pairs.pair.word1);
> GENERATE pair, count2/(double)denom.$1;}

我不断收到此错误:

'Pig script failed to parse: 
<file src/cluster.pig, line 27, column 15> expression is not a project expression: (Name: ScalarExpression) Type: null Uid: null)'

这指向带有FILTER的那一行；谷歌搜索这个错误并没有让我找到任何有用的东西.请帮忙！(ps.如果我从 foreach 中取出带有 FILTER 的行，这确实有效...)

This point to the line with the FILTER; googling this error did not lead me to anything helpful. Please help! (ps. this does work if I take the line with FILTER out of the foreach...)

在 PIG 的嵌套 FOREACH 中使用过滤器 [英] USING Filter in a Nested FOREACH in PIG

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

在 PIG 的嵌套 FOREACH 中使用过滤器 [英] USING Filter in a Nested FOREACH in PIG

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭