使用Android将Web JavaScript内容解析为字符串 [英] Parsing web javascript content to string using android

查看：40 发布时间：2021/2/14 18:44:55 javascript java android jsoup

本文介绍了使用Android将Web JavaScript内容解析为字符串的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我想将网站内容读成字符串.

I would like to read the content of a website into a string.

我通过使用jsoup开始，如下所示:

I started by using jsoup as follows:

private void getWebsite() {
    new Thread(new Runnable() {
        @Override
        public void run() {
            final StringBuilder builder = new StringBuilder();

            try {

                String query = "https://merhav.nli.org.il/primo-explore/search?tab=default_tab&search_scope=Local&vid=NLI&lang=iw_IL&query=any,contains,הארי פוטר";

                Document doc = Jsoup.connect(query).get();
                String title = doc.title();
                Elements links = doc.select("div");

                builder.append(title).append("\n");

                for (Element link : links) {
                    builder.append("\n").append("Link : ").append(link.attr("href"))
                            .append("\n").append("Text : ").append(link.text());
                }
            } catch (IOException e) {
                builder.append("Error : ").append(e.getMessage()).append("\n");
            }

            runOnUiThread(new Runnable() {
                @Override
                public void run() {
                    tv_result.setText(builder.toString());

                }
            });
        }
    }).start();
}

但是，问题是在该站点中，当我使用诸如chrome这样的网络浏览器时，它在其中一行中显示:

However, the problem is that in this site, when I web browser such as chrome it says in one of it lines:

window.appPerformance.timeStamps['index.html']= Date.now();</script><primo-explore><noscript>JavaScript must be enabled to use the system</noscript><style>.init-message {

所以我读到jsoup对于这种情况没有好的解决方案. 即使使用javascript也有什么好方法来获取此页面的元素?

So I read that jsoup doesn't have a good solution for this case. Is there any good way to get the element of this page even though that it uses javascript?

尝试以下建议后，我使用webView加载了网址，然后使用jsoap对其进行了解析，如下所示:

After trying the suggestions below, I used webView to load the url and then parsed it using jsoap as follows:

wb_result.getSettings().setJavaScriptEnabled(true);
MyJavaScriptInterface jInterface = new MyJavaScriptInterface();
wb_result.addJavascriptInterface(jInterface, "HtmlViewer");

wb_result.setWebViewClient(new WebViewClient() {
    @Override
    public void onPageFinished(WebView view, String url) {
        wb_result.loadUrl("javascript:window.HtmlViewer.showHTML ('<head>'+document.getElementsByTagName('html')[0].innerHTML+'</head>');");
    }
 });

它完成了工作，并确实向我展示了该元素.但是，仍然与浏览器不同，它显示某些行是功能，而不是结果.例如:

It did the job and indeed showed me the element. However, still, unlike a browser, it shows some lines as a function and not as a result. For example:

ng-href="{{::$ctrl.getDeepLinkPath()}}"

是否可以像浏览器一样解析和显示结果?

Is there a way to parse and display the result like in the browser?

谢谢

使用Android将Web JavaScript内容解析为字符串 [英] Parsing web javascript content to string using android

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录关闭

使用Android将Web JavaScript内容解析为字符串 [英] Parsing web javascript content to string using android

问题描述

推荐答案

相关文章

Java开发最新文章

热门教程

热门工具

登录 关闭

登录关闭