文章詳情頁

python處理文件內容的正確姿勢該怎樣？

瀏覽：97日期：2022-08-23 11:36:53

問題描述

大神們：

我想把htm文件中的第一個<link到第二個<link之間的所有內容另存為一個htm該怎么寫比較簡潔。

<meta http-equiv='X-UA-Compatible' content='IE=edge'><link rel='prefetch' ><meta name='application-name' content='Python.org'><meta name='msapplication-tooltip' content='The official home of the Python Programming Language'><meta name='apple-mobile-web-app-title' content='Python.org'><meta name='apple-mobile-web-app-capable' content='yes'><meta name='apple-mobile-web-app-status-bar-style' content='black'><meta name='viewport' content='width=device-width, initial-scale=1.0'><meta name='HandheldFriendly' content='True'><meta name='format-detection' content='telephone=no'><meta http-equiv='cleartype' content='on'><meta http-equiv='imagetoolbar' content='false'><script type='text/javascript' async='' src='https://ssl.google-analytics.com/ga.js'></script><script src='http://www.wxshucaidpc.com/wenda/Welcome to Python.org_files/modernizr.js.下載'></script><style type='text/css' adt='123'></style><link href='http://www.wxshucaidpc.com/wenda/Welcome to Python.org_files/style.css' rel='stylesheet' type='text/css'><link href='http://www.wxshucaidpc.com/wenda/Welcome to Python.org_files/mq.css' rel='stylesheet' type='text/css' media='not print, braille, embossed, speech, tty'>

提取的內容應該是：

<link rel='prefetch' ><meta name='application-name' content='Python.org'><meta name='msapplication-tooltip' content='The official home of the Python Programming Language'><meta name='apple-mobile-web-app-title' content='Python.org'><meta name='apple-mobile-web-app-capable' content='yes'><meta name='apple-mobile-web-app-status-bar-style' content='black'><meta name='viewport' content='width=device-width, initial-scale=1.0'><meta name='HandheldFriendly' content='True'><meta name='format-detection' content='telephone=no'><meta http-equiv='cleartype' content='on'><meta http-equiv='imagetoolbar' content='false'><script type='text/javascript' async='' src='https://ssl.google-analytics.com/ga.js'></script><script src='http://www.wxshucaidpc.com/wenda/Welcome to Python.org_files/modernizr.js.下載'></script><style type='text/css' adt='123'></style><link

問題解答

回答1：

import retext = ''with open('read.html', 'r') as rf: text = rf.read() pattern = r'<link[sS]*?<link'results = re.findall(pattern, text)if results: r = results[0] with open('write.html', 'w') as wf:wf.write(r) ================================================with open('read.html', 'r') as rf: with open('write.html', 'w') as wf:num = 0for line in rf.readlines(): if line.startswith('<link'):num += 1continue if num == 2:break wf.writelines(line)

Python 編程

上一條：python 如何讓字符串的不具有轉義的反斜杠具有轉義功能下一條：關于python切片的問題

相關文章：

1. HTML表單操作標簽調用父相對URL2. java - mac下配置ndk環境變量3. 為啥最大化個窗口還得找一堆理由?4. javascript - 根據不同數據顯示不同內容5. css3 - 如圖的flex骰子布局是怎么實現的？6. java - new + 類名，一定需要申明一個對象嗎？7. node.js - 用nodejs 的node-xlsx模塊去讀取excel中的數據，可是讀取出來的日期是數字，請問該如何讀取日期呢？8. javascript - 解釋下這種函數定義9. javascript - 在 vue里面用import引入js文件，結果為undefined10. css - psd設計稿給的是1920寬的，而我的電腦是1600寬的，那我在寫代碼時，是不是每個寬度都要計算調整

排行榜

					
					基于Spring MVC Java的配置無法正常工作控制臺顯示無錯誤，但我的jsp頁面未顯示
javascript -  解釋下這種函數定義
nignx - docker內nginx 80端口被占用
docker api 開發的端口怎么獲??？
5. java - mac下配置ndk環境變量
javascript - 根據不同數據顯示不同內容
css - psd設計稿給的是1920寬的，而我的電腦是1600寬的，那我在寫代碼時，是不是每個寬度都要計算調整
java - new + 類名，一定需要申明一個對象嗎？
javascript -  在 vue里面用import引入js文件，結果為undefined
node.js - 用nodejs 的node-xlsx模塊去讀取excel中的數據，可是讀取出來的日期是數字，請問該如何讀取日期呢？
				

熱門標簽

国产综合久久一区二区三区