且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

从网站提取数据

更新时间:2022-05-23 09:26:52

我相信问题是,这些值是通过你的浏览器中运行,一个javascript code呈现的urllib 不 - 你应该使用可以执行JavaScript的code库

I believe the problem is that these values are rendered through a javascript code which your browser runs and urllib doesn't - You should use a library that can execute javascript code.

我只是用Google搜索履带蟒蛇的JavaScript ,我得到了一些计算器问题和答案哪些建议使用的的WebKit 。您可以通过 scrapy 使用这些库。这里有两个片段:

I just googled crawler python javascript and I got the some *** questions and answers which recommends the use of selenium or webkit. You can use those libraries through scrapy. Here are two snippets:

渲染/互动JavaScript和GTK / WebKit的/ jswebkit

渲染的Javascript履带随着Scrapy和硒RC