更新时间:2023-02-10 20:12:54
使用正则表达式从中提取值HTML总是一个错误。 HTML语法要复杂得多,它可能首先出现,并且页面很容易捕捉到一个非常复杂的正则表达式。
Using regular expressions to pull values from HTML is always a mistake. HTML syntax is a lot more complex that it may first appear and it's very easy for a page to catch out even a very complex regular expression.
使用 HTML Parser 。另请参见有哪些优缺点领先的Java HTML解析器?