如何使用Marklogic数据库实时处理数据 [英] How to use marklogic database for real time processing of data

查看:99
本文介绍了如何使用Marklogic数据库实时处理数据的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在尝试评估marklogic用于数据的实时处理.早些时候,我使用kafka和storm来实时处理数据,并在处理后将其插入数据库.我是marklogic的新手,所以任何人都可以告诉我marklogic中是否有可用的东西,我可以将其用于数据的实时处理,并在对数据进行处理后再将其插入marklogic数据库中.

I am trying to evaluate marklogic for real time processing of the data. Earlier i have used kafka and storm for real time handling of data and after processing inserted to database. I am new to marklogic, so can anybody tell me is there anything available in marklogic which i can use for real time handling of data and after getting the data process it and then insert it into marklogic database.

推荐答案

MarkLogic具有极高的可扩展性,并具有触发器,警报和CPF之类的功能,您可以针对这些功能构建逻辑来决定如何处理传入的内容.但是,一些注意事项可以帮助您入门:

MarkLogic is extremely scalable and has features like triggers, Alerting and CPF for which you can build your logic to decide what to do with incoming content. But a few notes to get you started:

MarkLogic是无共享架构,因此每个节点上的CPU和HTTP服务器是独立的,因此在考虑如何平衡传入消息时必须牢记这一点.

MarkLogic is a share-nothing architecture, so CPU and HTTP servers on each node are independent, so you have to keep that in mind when you consider how to balance incoming messages.

MarkLogic也不会流式传输到磁盘.

MarkLogic also does not stream to disk.

MarkLogic可以通过出色的HTTP客户端进行连接,但是我认为没有现成的功能可以将内容附加到打开的连接中(这与为什么它也没有ftp功能有关,我相信).

MarkLogic can connect via a great HTTP client, but I do not believe there are any capabilities out-of-the-box to append content to an open connection (this is related to why it also has no ftp capability, I believe).

因此,我指出这些内容是为了使您了解您正在处理的是不同类型的系统.因此,方法是不一样的.实际上,通过使用预提交触发器或仅使用基于HTTP的应用程序并结合超快速功能(如反向查询)并设计解决方案以匹配MarkLogic的工作方式,处理大量数据以进行实时处理可能是一项艰巨的任务. 绝佳的解决方案.我研究了一个大型实现,MarkLogic乐意为此而接收和处理来自上游WebSphere消息代理的大量消息.有些消息是在内部处理的,而另一些则传递给Splunk和其他系统.

So, I point these items out so you understand that you are dealing with a different type of system. So the approach is just not the same. In fact, with the use of pre-commit triggers or just an http-based application mixed with super-fast features like reverse queries and designing your solution to match how MarkLogic works, handling huge amounts of data for real-time processing can be a great solution. There is one large implementation that I worked on for which MarkLogic is happily receiving and processing large volumes of messages form an upstream WebSphere message broker. Some messages are handled internally and others are passed on to Splunk and other systems.

我以高级别的方式回答了您的问题,因为它并不是真正的详细问题-MarkLogic是一个大型,强大的解决方案,您确实需要对其进行概述.如果有时间,我们将提供为期1天的免费培训课程,内容涵盖基本知识-使您可以更好地了解产品并根据需要对其进行评估.

I answered your question in a high-level way because it's not really asking a detailed question - and MarkLogic is a large, robust solution for which you really need to get an overview of on your own. If you have the time, there is a 1-day free training course that covers the fundamentals -which will allow you to better understand the product and assess it for you needs.

顺便说一句:MarkLogic的所有培训都是免费的.以下是基础课程的链接: http://www.marklogic.com/training -courses/marklogic-fundamentals /此课程也可以根据自己的时间(自定进度)

BTW: ALL training for MarkLogic is free. Here is the link to the fundamentals course: http://www.marklogic.com/training-courses/marklogic-fundamentals/ This one can also be take on your own time (self-paced)

这篇关于如何使用Marklogic数据库实时处理数据的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆