且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

csv - python多列存取爬蟲網頁?

更新时间:2022-12-15 22:27:39

写csv文件简单点 你的结构数据要成这样 [["1. 東森新聞雲","新聞"],["2. 創世黎明(Dawn of world)","遊戲"]]

from urllib import urlopen
from bs4 import BeautifulSoup
import re
import csv

html = urlopen("http://www.app12345.com/?area=tw&store=Apple%20Store")
bs0bj = BeautifulSoup (html)
GPnameList = [name.get_text() for name in bs0bj.find_all("dd",{"class":re.compile("ddappname")})]
GPcompanyname = [cpa.get_text() for cpa in bs0bj.find_all("dd",{"style":re.compile("color")})]

data = '\n'.join([','.join(d) for d in zip(GPnameList, GPcompanyname)])
with open('C:/Users/sa/Desktop/0217.csv','wb') as f:
     f.write(data.encode('utf-8'))