仅从文本文件中查找难以匹配的url

2024-06-16 11:02:08 发布

您现在位置:Python中文网/ 问答频道 /正文

我的文本文件包括:

http://www.makemytrip.com/
http://www.makemytrip.com/blog/dil-toh-roaming-hai?intid=Blog_HPHeader_Logo   //how do i remove /dil-toh-roaming-hai?intid=Blog_HPHeader_Logo 
http://www.makemytrip.com/rewards/?intid=New_ch_mtr_na
javascript:void(0)       //how do i remove this 
javascript:void(0)
javascript:void(0)
http://www.makemytrip.com/rewards/?intid=new_ch_mtr_dropdwn
https://support.makemytrip.com/MyAccount/MyTripReward/DashBoard
https://support.makemytrip.com/MyAccount/User/User
https://support.makemytrip.com/MyAccount/MyBookings/BookingSummary/
https://support.makemytrip.com/customersupports.aspx?actiontype=PRINTETICKET

如何只检查url并将它们保存在另一个文件中,以便一次只能解析一个url。我尝试了这个Python代码,但是它匹配并且只打开了第一个url。你知道吗

 import urllib

 with open("s.txt","r") as file:
 for line in file:
    url = urllib.urlopen(line)
    read = url.read()
    print read

Tags: httpscomhttpurlsupportreadwwwjavascript