使用Jsoup解析HTML时出错 [英] Error when parsing Html using Jsoup

查看:149
本文介绍了使用Jsoup解析HTML时出错的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我要解析HTML网站,并获得一个字符串值。但我分析的div类时收到错误。

 < D​​IV CLASS =内容明确>

我上面code写的,但我收到的错误。

  {尝试
        DOC = Jsoup.connect(http://tvrehberi.hurriyet.com.tr/program-detay/308271/deli-deli-olma)获得();清单<串GT; saatItem =新的ArrayList<串GT;();
        对于(元迭代:doc.getElementsByClass(内容和放大器;明确)){
            saatItem.add(iterable.text());        }
赶上(IOException异常五){
        // TODO自动生成catch块
        e.printStackTrace();
    }

此错误的原因阶层有空白字符。如果类值没有空白字符,code完美运行。
我怎样才能解决这个问题?

错误日志:

  02-06 00:18:53.770:E / AndroidRuntime(28775):致命异常:主要
02-06 00:18:53.770:E / AndroidRuntime(28775):了java.lang.RuntimeException:无法启动活动ComponentInfo {com.example.htmlparsingtutorial / com.example.htmlparsingtutorial.MainActivity}:android.os.NetworkOnMainThreadException
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.ActivityThread.performLaunchActivity(ActivityThread.java:2100)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.ActivityThread.handleLaunchActivity(ActivityThread.java:2125)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.ActivityThread.access $ 600(ActivityThread.java:140)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.ActivityThread $ H.handleMessage(ActivityThread.java:1227)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.os.Handler.dispatchMessage(Handler.java:99)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.os.Looper.loop(Looper.java:137)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.ActivityThread.main(ActivityThread.java:4898)
02-06 00:18:53.770:E / AndroidRuntime(28775):在java.lang.reflect.Method.invokeNative(本机方法)
02-06 00:18:53.770:E / AndroidRuntime(28775):在java.lang.reflect.Method.invoke(Method.java:511)
02-06 00:18:53.770:E / AndroidRuntime(28775):在com.android.internal.os.ZygoteInit $ MethodAndArgsCaller.run(ZygoteInit.java:1006)
02-06 00:18:53.770:E / AndroidRuntime(28775):在com.android.internal.os.ZygoteInit.main(ZygoteInit.java:773)
02-06 00:18:53.770:E / AndroidRuntime(28775):在dalvik.system.NativeStart.main(本机方法)
02-06 00:18:53.770:E / AndroidRuntime(28775):由:android.os.NetworkOnMainThreadException
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.os.StrictMode $ AndroidBlockGuardPolicy.onNetwork(StrictMode.java:1118)
02-06 00:18:53.770:E / AndroidRuntime(28775):在java.net.InetAddress.lookupHostByName(InetAddress.java:385)
02-06 00:18:53.770:E / AndroidRuntime(28775):在java.net.InetAddress.getAllByNameImpl(InetAddress.java:236)
02-06 00:18:53.770:E / AndroidRuntime(28775):在java.net.InetAddress.getAllByName(InetAddress.java:214)
。02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpConnection<&初始化GT;(HttpConnection.java:70)
。02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpConnection<&初始化GT;(HttpConnection.java:50)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpConnection $ Address.connect(HttpConnection.java:340)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpConnectionPool.get(HttpConnectionPool.java:87)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpConnection.connect(HttpConnection.java:128)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpEngine.openSocketConnection(HttpEngine.java:315)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpEngine.connect(HttpEngine.java:310)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpEngine.sendSocketRequest(HttpEngine.java:289)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpEngine.sendRequest(HttpEngine.java:239)
02-06 00:18:53.770:E / AndroidRuntime(28775):在libcore.net.http.HttpURLConnectionImpl.connect(HttpURLConnectionImpl.java:80)
02-06 00:18:53.770:E / AndroidRuntime(28775):在org.jsoup.helper.HttpConnection $ Response.execute(HttpConnection.java:408)
02-06 00:18:53.770:E / AndroidRuntime(28775):在org.jsoup.helper.HttpConnection $ Response.execute(HttpConnection.java:393)
02-06 00:18:53.770:E / AndroidRuntime(28775):在org.jsoup.helper.HttpConnection.execute(HttpConnection.java:159)
02-06 00:18:53.770:E / AndroidRuntime(28775):在org.jsoup.helper.HttpConnection.get(HttpConnection.java:148)
02-06 00:18:53.770:E / AndroidRuntime(28775):在com.example.htmlparsingtutorial.MainActivity.onCreate(MainActivity.java:90)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.Activity.performCreate(Activity.java:5206)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.Instrumentation.callActivityOnCreate(Instrumentation.java:1083)
02-06 00:18:53.770:E / AndroidRuntime(28775):在android.app.ActivityThread.performLaunchActivity(ActivityThread.java:2064)
02-06 00:18:53.770:E / AndroidRuntime(28775):... 11个


解决方案

是的,如果你运行在主线程网络code此异常。
该解决方案是运行你的任务作为 AsyncTask的

BTW。请测试一下,code,在PC上它的工作原理:

 文档的DOC = Jsoup.connect(http://tvrehberi.hurriyet.com.tr/program-detay/308271/deli-deli-olma)获得();清单<串GT; saatItem =新的ArrayList<串GT;();对于(element元素:doc.select(DIV [CLASS =内容清]))
{
    saatItem.add(element.text());
}

I want parsing html site and get a string value. But i receive error when parsing div class.

<div class="content clear">

I wrote above code but i received error.

try {
        doc = Jsoup.connect("http://tvrehberi.hurriyet.com.tr/program-detay/308271/deli-deli-olma").get();

List<String> saatItem = new ArrayList<String>();
        for (Element  iterable : doc.getElementsByClass("content&clear")) {
            saatItem.add(iterable.text());

        }
catch (IOException e) {
        // TODO Auto-generated catch block
        e.printStackTrace();
    }

This error cause class have blank character. If class value hasn't blank character, code runs perfectly. How can i solve this problem ?

Error logs :

02-06 00:18:53.770: E/AndroidRuntime(28775): FATAL EXCEPTION: main
02-06 00:18:53.770: E/AndroidRuntime(28775): java.lang.RuntimeException: Unable to start activity ComponentInfo{com.example.htmlparsingtutorial/com.example.htmlparsingtutorial.MainActivity}: android.os.NetworkOnMainThreadException
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.ActivityThread.performLaunchActivity(ActivityThread.java:2100)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.ActivityThread.handleLaunchActivity(ActivityThread.java:2125)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.ActivityThread.access$600(ActivityThread.java:140)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.ActivityThread$H.handleMessage(ActivityThread.java:1227)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.os.Handler.dispatchMessage(Handler.java:99)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.os.Looper.loop(Looper.java:137)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.ActivityThread.main(ActivityThread.java:4898)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at java.lang.reflect.Method.invokeNative(Native Method)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at java.lang.reflect.Method.invoke(Method.java:511)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at com.android.internal.os.ZygoteInit$MethodAndArgsCaller.run(ZygoteInit.java:1006)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at com.android.internal.os.ZygoteInit.main(ZygoteInit.java:773)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at dalvik.system.NativeStart.main(Native Method)
02-06 00:18:53.770: E/AndroidRuntime(28775): Caused by: android.os.NetworkOnMainThreadException
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.os.StrictMode$AndroidBlockGuardPolicy.onNetwork(StrictMode.java:1118)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at java.net.InetAddress.lookupHostByName(InetAddress.java:385)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at java.net.InetAddress.getAllByNameImpl(InetAddress.java:236)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at java.net.InetAddress.getAllByName(InetAddress.java:214)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpConnection.<init>(HttpConnection.java:70)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpConnection.<init>(HttpConnection.java:50)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpConnection$Address.connect(HttpConnection.java:340)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpConnectionPool.get(HttpConnectionPool.java:87)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpConnection.connect(HttpConnection.java:128)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpEngine.openSocketConnection(HttpEngine.java:315)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpEngine.connect(HttpEngine.java:310)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpEngine.sendSocketRequest(HttpEngine.java:289)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpEngine.sendRequest(HttpEngine.java:239)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at libcore.net.http.HttpURLConnectionImpl.connect(HttpURLConnectionImpl.java:80)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:408)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at org.jsoup.helper.HttpConnection$Response.execute(HttpConnection.java:393)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at org.jsoup.helper.HttpConnection.execute(HttpConnection.java:159)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at org.jsoup.helper.HttpConnection.get(HttpConnection.java:148)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at com.example.htmlparsingtutorial.MainActivity.onCreate(MainActivity.java:90)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.Activity.performCreate(Activity.java:5206)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.Instrumentation.callActivityOnCreate(Instrumentation.java:1083)
02-06 00:18:53.770: E/AndroidRuntime(28775):    at android.app.ActivityThread.performLaunchActivity(ActivityThread.java:2064)
02-06 00:18:53.770: E/AndroidRuntime(28775):    ... 11 more

解决方案

Yes, this exception is thrown if you run networkcode in the main thread. The solution is to run your task as an AsyncTask.

btw. please test this code, on PC it works:

Document doc = Jsoup.connect("http://tvrehberi.hurriyet.com.tr/program-detay/308271/deli-deli-olma").get();

List<String> saatItem = new ArrayList<String>();

for( Element element : doc.select("div[class=content clear]"))
{
    saatItem.add(element.text());
}

这篇关于使用Jsoup解析HTML时出错的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆