且构网

分享程序员开发的那些事...
且构网 - 分享程序员编程开发的那些事

python 3.1.3支持unicode在csv模块?

更新时间:2023-11-10 16:33:40

Python 3绝对支持unicode。我的猜测是,当您打开CSV文件进行阅读时,您指定了错误(或不?)编码。请参阅: http://docs.python.org/release/3.1 .3 / library / functions.html#open



并尝试类似的操作:

  reader = csv.reader(open(foo.csv,encoding =utf-8))

编辑:如果您使用的是Python 2.6,您可以通过以下方式获得相同的结果:

  import codecs 
reader = csv.reader(codecs.open(foo.csv,encoding =utf-8))
$ p>

如果你得到空字节,你的文件可能会使用utf-16 ,所以如果文件不能使用utf-8解码,请尝试。


I have been using python 2.6. While I was writing a python program to process the query result ( in csv format ) from sql server. I found it does not support unicode.

When I run the program with csv file, a error poped up saying:

    for row in csvReader:
Error: line contains NULL byte

After I save the csv file in ANSI/ASCII format with Ultraedit, the program is running okay.

I tried to include the encoding option, but it failed:

csvReader = csv.reader(open(fname, mode='rb', encoding='unicode'), delimiter=',')
TypeError: 'encoding' is an invalid keyword argument for this function

csvReader = csv.reader(open(fname, mode='rb', encoding='utf-8'), delimiter=',')
TypeError: 'encoding' is an invalid keyword argument for this function

I wonder if python 3 support this unicode reading. It can save me a lot of work.

Python 3 definitely supports unicode. My guess is that you specified the wrong (or no?) encoding when you opened the CSV file for reading. See: http://docs.python.org/release/3.1.3/library/functions.html#open

And try something like:

reader = csv.reader(open("foo.csv", encoding="utf-8"))

Edit: If you are using Python 2.6, you can achieve the same result with:

import codecs
reader = csv.reader(codecs.open("foo.csv", encoding="utf-8"))

HOWEVER if you're getting null bytes, your file may be encoded using "utf-16", so try that if the file can't be decoded using utf-8.