如何抓取网站和获取网站链接 [英] How to crawl websites and fetch link of websites
本文介绍了如何抓取网站和获取网站链接的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
如何抓取网站链接并在我们自己的网站上显示
1:网站内容链接应与财务相关
how to crawl websites link and show on our own websites
1:Content of websites link should be related to finance
推荐答案
你不会在几篇论坛帖子中得到那种解释。有太多的信息可以过去。
Google用于文章和示例的C#网络爬虫。
You're not going to get that kind of explanation in a couple of forum posts. There's simply too much information to go over.
Google for "C# web crawler" for articles and examples.
非常基本,它需要一些 Web抓取的技术: http://en.wikipedia.org/wiki/Web_scraping [ ^ ]。
请查看我过去的答案:
从网页获取特定数据 [ ^ ],
执行某种Web请求并获得结果 [ ^ ]。
-SA
Very basically, it requires some techniques of Web scraping: http://en.wikipedia.org/wiki/Web_scraping[^].
Please see my past answers:
get specific data from web page[^],
Performing a some kind of Web Request and getting result[^].
—SA
如果您要自己编写代码,您可能需要查看HTML Agaility Pack http://htmlagilitypack.codeplex.com/ [ ^ ]用于解析HTML并使用HttpWebRequest或类似内容来获取页面内容。
If you're going to code it yourself, you'll probably want to take a look at the HTML Agaility Pack http://htmlagilitypack.codeplex.com/[^] for parsing the HTML and using HttpWebRequest's or something similar to get the page content.
这篇关于如何抓取网站和获取网站链接的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文