首页
其他开发
Gremlin：GroupBy顶点，计数> 1

Gremlin：GroupBy顶点，计数> 1 [英] Gremlin : GroupBy vertices , having count > 1

查看：508 发布时间：2018/5/25 17:43:16 graph graph-databases titan gremlin

本文介绍了Gremlin：GroupBy顶点，计数> 1的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我使用TITAN 0.4和gremlin进行遍历。
我的要求是在图中标识重复的顶点，并合并它们。
图中有> 15 M个顶点。

  gremlin> gVhas（'domain'）。groupBy {it.domain} {it.id} .cap 
 
 ==> {google.com = [4]，yahoo.com = [16，24 ，20]}

我能够对顶点进行分组，但我只需要那些域（顶点）它不止一次存在。

在上面的例子中，我只需要返回==>> {yahoo.com = [16，24，20] $ b考虑使用 groupCount 而不是 groupBy

解决方案以保存在收集列表中计算ID的步骤： 
 
 
  gVhas（'domain'）。groupCount（it.domain } .cap.next（）。findAll {it.value> 1} 
  
我猜这样比较便宜以及更大的遍历，因为你只是维护一个计数器而不是标识符列表。
 
I am using TITAN 0.4, and gremlin for traversals.
My requirement is to identify duplicate vertices in graph, and to merge those.
There are > 15 M vertices in graph.
gremlin> g.V.has('domain').groupBy{it.domain}{it.id}.cap

==>{google.com=[4], yahoo.com=[16, 24, 20]}
I am able to group the vertices, but I need only those domains(vertices) which exists more than once. 

In the above example, I need to return only ==>{yahoo.com=[16, 24, 20]}
The key "domain" is indexed, if that makes any difference.

Please help me here
 解决方案 
Consider use of groupCount rather than groupBy to save a step of counting up ids in your collected list:
g.V.has('domain').groupCount(it.domain}.cap.next().findAll{it.value>1}
I suppose this is cheaper as well on a larger traversal as you are just maintaining a counter rather than lists of identifiers.

                        这篇关于Gremlin：GroupBy顶点，计数&gt; 1的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持IT屋！


                    
                        查看全文


        
            



        
        
            相关文章
            
                    
                        
                            Gremlin删除多个顶点;
                        
                    
                    
                        
                            Gremlin 删除所有顶点;
                        
                    
                    
                        
                            Gremlin删除所有顶点;
                        
                    
                    
                        
                            Gremlin:如何返回顶点及其关联的顶点?;
                        
                    
                    
                        
                            Tinkerpop/Gremlin合并顶点(和边);
                        
                    
                    
                        
                            Gremlin查询以覆盖顶点;
                        
                    
                    
                        
                            Gremlin - 如果顶点不存在，则仅添加顶点;
                        
                    
                    
                        
                            Gremlin，如何在gremlin-python中为现有顶点添加边;
                        
                    
                    
                        
                            Gremlin过滤器计数;
                        
                    
                    
                        
                            在1个查询gremlin中创建不存在的顶点和边缘;
                        
                    
                    
                        
                            GROUPBY以计数;
                        
                    
                    
                        
                            如何在Gremlin上更新几个顶点属性?;
                        
                    
                    
                        
                            OrientDB Gremlin-检索gremlin中的类的顶点但未达到索引;
                        
                    
                    
                        
                            如何通过Java访问Gremlin中路径的顶点?;
                        
                    
                    
                        
                            Gremlin-如果超出限制，则添加新顶点;
                        
                    
                    
                        
                            在本地dynamodb的gremlin图中创建多个顶点;
                        
                    
                    
                        
                            Linq .GroupBy()与计数;
                        
                    
                    
                        
                            在java中的gremlin titan中过滤出顶点数量上的顶点;
                        
                    
                    
                        
                            如何删除gremlin中没有边的不可修改顶点?;
                        
                    
                    
                        
                            LINQ 与 groupby 和计数;
                        
                    
                    
                        
                            LINQ与GROUPBY和计数;
                        
                    
                    
                        
                            熊猫groupby与bin计数;
                        
                    
                    
                        
                            Gremlin-Server添加具有多个属性的顶点（Titan 1.0.0）;
                        
                    
                    
                        
                            Gremlin在Azure CosmosDB上:如何投影相关顶点的属性?;
                        
                    
                    
                        
                            Gremlin-Server 添加具有多个属性的顶点 (Titan 1.0.0);


    
        
            其他开发最新文章
            
                    
                        
                            拒绝显示一个框架，因为它将'X-Frame-Options'设置为'sameorigin';
                        
                    
                    
                        
                            什么是＆QUOT; AW＆QUOT;在部分标志属性是什么意思？;
                        
                    
                    
                        
                            在运行npm install命令时获取'npm WARN弃用'警告;
                        
                    
                    
                        
                            cmake无法找到openssl;
                        
                    
                    
                        
                            从Spark的scala中的* .tar.gz压缩文件中读取HDF5文件;
                        
                    
                    
                        
                            Twitter :: Error :: Forbidden  - 无法验证您的凭据;
                        
                    
                    
                        
                            我什么时候需要一个fb：app_id或者fb：admins？;
                        
                    
                    
                        
                            将.db文件导入R;
                        
                    
                    
                        
                            npm通知创建一个lockfile作为package-lock.json。你应该提交这个文件;
                        
                    
                    
                        
                            拒绝执行内联脚本，因为它违反了以下内容安全策略指令：“script-src'self'”;
                        
                    
            
        
        
            
                热门教程
            
            
                
                    
                        Java教程
                    
                
                
                    
                        Apache ANT 教程
                    
                
                
                    
                        Kali Linux教程
                    
                
                
                    
                        JavaScript教程
                    
                
                
                    
                        JavaFx教程
                    
                
                
                    
                        MFC 教程
                    
                
                
                    
                        Apache HTTP客户端教程
                    
                
                
                    
                        Microsoft Visio 教程
                    
                
            
        
        
            
                热门工具
            
            
                
                
                    
                        Java 在线工具
                    
                
                
                    
                        C(GCC) 在线工具
                    
                
                
                    
                        PHP 在线工具
                    
                
                
                    
                        C# 在线工具
                    
                
                
                    
                        Python 在线工具
                    
                
                
                    
                        MySQL 在线工具
                    
                
                
                    
                        VB.NET 在线工具
                    
                
                
                    
                        Lua 在线工具
                    
                
                
                    
                        Oracle 在线工具
                    
                
                
                    
                        C++(GCC) 在线工具
                    
                
                
                    
                        Go 在线工具
                    
                
                
                    
                        Fortran 在线工具



    
        
            登录
            关闭
        
        
            
                扫码关注1秒登录
            
            
                
            
            
                
                
            
            
                发送“验证码”获取
                |
                15天全站免登陆
            
            
        
    
    





    
		
			友情链接：
            IT屋
            Chrome插件
            谷歌浏览器插件
        
        
            IT屋
            ©2016-2022 琼ICP备2021000895号-1
            站点地图
            站点标签
            SiteMap
            <免责申明>
            本站内容来源互联网,如果侵犯您的权益请联系我们删除.