ÃֽŠ°Ô½Ã±Û(JAVA)
2017.07.10 / 17:12
Java - Obtain text within script tag using Jsoup
XMaLL°ü¸®ÀÚ
Ãßõ ¼ö 251
I am using the Jsoup library to read a URL. This url has text within a few <script>
tags. Is it possible for me to obtain the text within each <script>
tag? Please note that I am not asking to parse a Javascript file as I am already aware JSoup does not allow that. The actual source code of the URL has text within a script tag, I need that.
doc = Jsoup.connect("http://www.example.com").timeout(10000).get();
Element div = doc.select("script").first();
for (Element element : div.children()) {
System.out.println(element.toString());
}
This is what one of the script tags look like from the source code:
<script type="text/javascript">
(function() {
...
})();
</script>
Yes. You can use Element#getElementsByTag() to get all the script
tag . Each script tags will be represented by the DataNode.
Document doc =Jsoup.connect("http://stackoverflow.com/questions/16780517/java-obtain-text-within-script-tag-using-jsoup").timeout(10000).get();
Elements scriptElements = doc.getElementsByTag("script");
for (Element element :scriptElements ){
for (DataNode node : element.dataNodes()) {
System.out.println(node.getWholeData());
}
System.out.println("-------------------");
}