从vb.net的html源代码(网站)中提取特定的html字符串 [英] Extract specific html string from html source code(website) in vb.net

查看:730
本文介绍了从vb.net的html源代码(网站)中提取特定的html字符串的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

其实我有完整的网站源代码..我想提取特定的div标签
之间的数据,这里是我的代码..

Actually I have full html source code of the website ..I want to extract data between the specific div tag here is my code..

Dim request As WebRequest = WebRequest.Create("https://www.crowdsurge.com/store/index.php?storeid=1056&menu=detail&eventid=41815")
    Using response As WebResponse = request.GetResponse()
        Using reader As New StreamReader(response.GetResponseStream())
            html = reader.ReadToEnd()
        End Using
    End Using

    Dim pattern1 As String = "<div class = ""ei_value ei_date"">(.*)"
    Dim m As Match = Regex.Match(html, pattern1)
    If m.Success Then
        MsgBox(m.Groups(1).Value)
    End If


推荐答案

解析HTML的一种更简单的方法(尤其是源自你不控制的)是使用 HTML Agility Pack ,它可以让你做一些事情:

An easier approach for parsing HTML (especially from a source that you don't control) is to use the HTML Agility Pack, which would allow you to do something a little like:

Dim req As WebRequest = WebRequest.Create("https://www.crowdsurge.com/store/index.php?storeid=1056&menu=detail&eventid=41815")
Dim doc As New HtmlDocument()
Using res As WebResponse = req.GetResponse()
    doc.Load(res.GetResponseStream())
End Using

Dim nodes = doc.DocumentNode.SelectNodes("//div[@class='ei_value ei_date']")
If nodes IsNot Nothing Then
    For Each var node in nodes
        MsgBox(node.InnerText)
    Next
End IF

(我假设 Option Infer

这篇关于从vb.net的html源代码(网站)中提取特定的html字符串的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆