文章詳情頁

python2.7 - python 中文寫入文件后亂碼

瀏覽：122日期：2022-09-16 09:17:07

問題描述

一個很簡單的小爬蟲程序

for i in L:content = urllib2.urlopen(’http://X.X.X.X/cgi-bin/GetDomainOwnerInfo?domain=%s’ %i)html = content.read()with open(’domain_test.xml’,’a’) as f: f.write(html) print html

print 的結果是中文：

但直接打開xml文本的時候卻是亂碼：

Windows 7 操作系統，python 2.7

請問一下各位，這個問題如何解決？

問題解答

回答1：

你需要知道 content 的編碼方式，并考慮是否要轉換

你需要用 utf-8 打開文件，然后寫入

codecs.open(filename, mode[, encoding[, errors[, buffering]]])

Open an encoded file using the given mode and return a wrapped versionproviding transparent encoding/decoding. The default file mode is ’r’meaning to open the file in read mode.

Note The wrapped version will only accept the object format defined bythe codecs, i.e. Unicode objects for most built-in codecs. Output isalso codec-dependent and will usually be Unicode as well. Note Filesare always opened in binary mode, even if no binary mode was specified. This is done to avoid data loss due to encodings using8-bit values. This means that no automatic conversion of ’n’ is doneon reading and writing. encoding specifies the encoding which is to beused for the file.errors may be given to define the error handling. It defaults to’strict’ which causes a ValueError to be raised in case an encodingerror occurs.buffering has the same meaning as for the built-in open() function. Itdefaults to line buffered.

import codecsf = codecs.open('domain_test.xml', 'w', 'utf-8')回答2：

試試在文件開頭加上 # -*- coding: utf-8 -*-

回答3：

在文件開頭加上 #coding:utf-8

Python 編程

上一條：【python|scapy】sprintf輸出時raw_string轉string下一條：python - 能通過CAN控制一部普通的家用轎車嗎？

相關文章：

1. python - 我在使用pip install -r requirements.txt下載時，為什么部分能下載，部分不能下載2. mysql - jdbc的問題3. python - 如何正則字符串中的所有漢字4. mysql - 分庫分表、分區、讀寫分離這些都是用在什么場景下，會帶來哪些效率或者其他方面的好處5. mysql - 如何減少使用或者不用LEFT JOIN查詢？6. python - 編碼問題求助7. mysql 5個left關鍵然后再用搜索條件幾千條數據就會卡，如何解決呢8. 視頻文件不能播放，怎么辦？9. python - oslo_config10. 圖片鏈接的地址怎么獲得的

排行榜

					
					python - Win7調用flup報錯’module’ object has no attribute ’fromfd’
javascript - npm安裝警告
javascript - 關于css絕對定位在ios瀏覽器被橡皮筋遮擋的問題
python - 小白django提交數據后，沒有存儲到數據庫（查閱資料并沒有發現問題）
python - 如何正則字符串中的所有漢字
python - 我在使用pip install -r requirements.txt下載時，為什么部分能下載，部分不能下載
docker安裝后出現Cannot connect to the Docker daemon.
Docker for Mac 創建的dnsmasq容器連不上/不工作的問題
docker內創建jenkins訪問另一個容器下的服務器問題
debian - docker依賴的aufs-tools源碼哪里可以找到啊？
視頻文件不能播放，怎么辦？
				

熱門標簽