刮使用HTML敏捷包 [英] Scraping using Html Agility Package

查看：140 发布时间：2016/6/15 21:24:18 html asp.net xpath html-agility-pack

本文介绍了刮使用HTML敏捷包的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我试图使用HtmlAgilityPackage新闻文章链接如下：<一刮数据href=\"http://www.ndtv.com/india-news/vyapam-scam-documents-show-chief-minister-shivraj-chouhan-delayed-probe-780528\" rel=\"nofollow\">http://www.ndtv.com/india-news/vyapam-scam-documents-show-chief-minister-shivraj-chouhan-delayed-probe-780528

I am trying to scrape data from a news article using HtmlAgilityPackage the link is as follows http://www.ndtv.com/india-news/vyapam-scam-documents-show-chief-minister-shivraj-chouhan-delayed-probe-780528

我写了下面的code以下，以提取该文章，但由于某种原因，我的变量aTags正在返回空值的所有注释

I have written the following code below to extract all the comments in this articles but for some reason my variable aTags is returning null value

code：

var getHtmlWeb = new HtmlWeb();
        var document = getHtmlWeb.Load(txtinputurl.Text);
        var aTags =    document.DocumentNode.SelectNodes("//div[@class='com_user_text']");
        int counter = 1;
        if (aTags != null)
        {
            foreach (var aTag in aTags)
            {
                lbloutput.Text += lbloutput.Text + ". " + aTag.InnerHtml + "\t" + "<br />";
                counter++;
            }
        }

我也用这个XPath，但仍是同样的结果// DIV [@类='newcomment_list'] / UL /李/ DIV [@类='headerwrap'] / DIV [@类='com_user_text']
请帮我用正确的XPath来提取所有评论
找遍了网，但没有办法了。

I have also used this XPath but still the same result //div[@class='newcomment_list']/ul/li/div[@class='headerwrap']/div[@class='com_user_text'] Please help me with the correct Xpath to Extract all the comments Searched all over the net but no solution.

刮使用HTML敏捷包 [英] Scraping using Html Agility Package

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录关闭

刮使用HTML敏捷包 [英] Scraping using Html Agility Package

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录 关闭

登录关闭