文章詳情頁

python2.7 - python 中文寫入文件后亂碼

瀏覽：153日期：2022-09-16 09:17:07

問題描述

一個很簡單的小爬蟲程序

for i in L:content = urllib2.urlopen(’http://X.X.X.X/cgi-bin/GetDomainOwnerInfo?domain=%s’ %i)html = content.read()with open(’domain_test.xml’,’a’) as f: f.write(html) print html

print 的結(jié)果是中文：

但直接打開xml文本的時候卻是亂碼：

Windows 7 操作系統(tǒng)，python 2.7

請問一下各位，這個問題如何解決？

問題解答

回答1：

你需要知道 content 的編碼方式，并考慮是否要轉(zhuǎn)換

你需要用 utf-8 打開文件，然后寫入

codecs.open(filename, mode[, encoding[, errors[, buffering]]])

Open an encoded file using the given mode and return a wrapped versionproviding transparent encoding/decoding. The default file mode is ’r’meaning to open the file in read mode.

Note The wrapped version will only accept the object format defined bythe codecs, i.e. Unicode objects for most built-in codecs. Output isalso codec-dependent and will usually be Unicode as well. Note Filesare always opened in binary mode, even if no binary mode was specified. This is done to avoid data loss due to encodings using8-bit values. This means that no automatic conversion of ’n’ is doneon reading and writing. encoding specifies the encoding which is to beused for the file.errors may be given to define the error handling. It defaults to’strict’ which causes a ValueError to be raised in case an encodingerror occurs.buffering has the same meaning as for the built-in open() function. Itdefaults to line buffered.

import codecsf = codecs.open('domain_test.xml', 'w', 'utf-8')回答2：

試試在文件開頭加上 # -*- coding: utf-8 -*-

回答3：

在文件開頭加上 #coding:utf-8

Python 編程

上一條：【python|scapy】sprintf輸出時raw_string轉(zhuǎn)string下一條：python - 能通過CAN控制一部普通的家用轎車嗎？

相關(guān)文章：

1. python - 有什么好的可以收集貨幣基金的資源?2. java - 為什么第一個線程已經(jīng)釋放了鎖，第二個線程卻不行？3. javascript - 關(guān)于<a>元素與<input>元素的JS事件運行問題4. css3 - 我想要背景長度變化，而文字不移動，要怎么修改呢5. MySQL中的enum類型有什么優(yōu)點？6. css3 - 純css實現(xiàn)點擊特效7. python - 啟動Eric6時報錯：’qscintilla_zh_CN’ could not be loaded8. mysql - 記得以前在哪里看過一個估算時間的網(wǎng)站9. android下css3動畫非常卡，GPU也不差啊10. 大家好，我想請問一下怎么做搜索欄能夠搜索到自己網(wǎng)站的內(nèi)容。

排行榜

					
					python - 有什么好的可以收集貨幣基金的資源?
MySQL中的enum類型有什么優(yōu)點？
java - 為什么第一個線程已經(jīng)釋放了鎖，第二個線程卻不行？
css3 - 純css實現(xiàn)點擊特效
android下css3動畫非常卡，GPU也不差啊
css3 - 我想要背景長度變化，而文字不移動，要怎么修改呢
javascript - 關(guān)于<a>元素與<input>元素的JS事件運行問題
mysql - 記得以前在哪里看過一個估算時間的網(wǎng)站
python - 啟動Eric6時報錯：’qscintilla_zh_CN’ could not be loaded
在windows下安裝docker  Toolbox 啟動Docker Quickstart Terminal 失敗！
docker gitlab 如何git clone？
				

熱門標簽

国产成人精品亚洲777人妖,欧美日韩精品一区视频,最新亚洲国产,国产乱码精品一区二区亚洲

python2.7 - python 中文寫入文件后亂碼