在Azure搜索中的数据源中添加新数据时,从数据源同步搜索索引 [英] sync search index from data source when ever new data is added in data source in Azure search

查看:84
本文介绍了在Azure搜索中的数据源中添加新数据时,从数据源同步搜索索引的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述


大家好,

Hi All,

 可以任何一个 请建议在数据源中添加新数据时,从数据源同步搜索索引的最佳方法是什么。安排程序或以编程方式更新 搜索索引。

 Can any one  please suggest what would be best approach to sync search index from data source when ever new data is added in data source. Schedule program or programmatically to update  search index.

如果数据源发生频繁变化并重新索引搜索索引,是否有任何影响。

Is there any impact if frequent changes in occur in data source and re-indexing the search index.

谢谢

坚持。

推荐答案

嗨Asif,

这听起来像当数据添加到Cosmos DB时,您希望在Azure搜索中更多地实时更新内容。您应该考虑为此目的利用CosmosDB更改源。您可以将其链接到Azure功能,以便Cosmos DB中的更改触发
一个函数,该函数将使用此搜索更改并使用Azure Search PUSH api将其应用于Azure搜索。 通常,Azure搜索索引器不适合需要更多内容实时更新的情况,因为它是预定的
流程。

This sounds like you want more of a real time update of content in Azure Search as data is added to Cosmos DB. You should consider leveraging the CosmosDB change feed for this purpose. You can link this to Azure Functions so that changes in Cosmos DB triggers a function which would take this feed of changes and applies it to Azure Search using the Azure Search PUSH api.  In general, the Azure Search indexer is not well suited for cases where you need to have more real time updates of content as it is a scheduled process.

关于您的具体情况问题是,如果发生了很多变化,由于所有资源都是共享的,因此对Azure搜索的索引会对搜索查询的性能产生影响,但Azure搜索可以非常有效地索引大量内容
所以它可能不是那么大的问题。 因此,在Azure搜索索引器中利用更改跟踪机制非常重要,这样它才会请求已更改的数据,而不是获取所有数据。 
对此类问题的唯一真正答案是测试工作负载以确保性能满足其需求并调整搜索服务层以满足要求。 

As to your specific question, if there are a lot of changes happening, the indexing into Azure Search will have an impact on the performance of the search queries since all of the resources are shared, however Azure Search can index large amounts of content pretty efficiently so it may not be that much of an issue.  For this reason, it is important to leverage a change tracking mechanism in Azure Search indexer, so that it only requests the data that has been changed, rather than getting all of the data.  The only real answer to perf questions like this is to test the workload to make sure the performance meets their needs and adjust the search service tier to meet the requirements. 





这篇关于在Azure搜索中的数据源中添加新数据时,从数据源同步搜索索引的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆