利用Python检测URL状态

最近小编接到这样的需求,Python检测URL状态,并追加保存200的URL。接下来通过实例代码给大家分析讲解,需要的朋友跟随小编一起看看吧

需求:Python检测URL状态,并追加保存200的URL

代码一:

#! /usr/bin/env python #coding=utf-8 import sys import requests def getHttpStatusCode(url): try: request = requests.get(url) httpStatusCode = request.status_code return httpStatusCode except requests.exceptions.HTTPError as e: return e if __name__ == "__main__": with open('1.txt', 'r') as f: for line in f: try: status = getHttpStatusCode(line.strip('n'))#换行符 if status == 200: with open('200.txt','a') as f: f.write(line + 'n') print line else: print 'no 200 code' except Exception as e: print e

代码二:

#! /usr/bin/env python # -*--coding:utf-8*- import requests def request_status(line): conn = requests.get(line) if conn.status_code == 200: with open('url_200.txt', 'a') as f: f.write(line + 'n') return line13 else: return None if __name__ == '__main__': with open('/1.txt', 'rb') as f: for line in f: try: purge_url = request_status(line.strip('n')) except Exception as e: pass

代码三:

#! /usr/bin/env python #coding:utf-8 import os,urllib,linecache import sys result = list() for x in linecache.updatecache(r'1.txt'): try: a = urllib.urlopen(x.replace('/n','')).getcode() #print x,a except Exception,e: print e if a == 200: #result.append(x) #保存 #result.sort() #排序结果 #open('2.txt', 'w').write('%s' % 'n'.join(result)) #保存入结果文件 with open ('200urllib.txt','a') as f: ## r只读,w可写,a追加 f.write(x + 'n') else: print 'error'

总结

以上所述是小编给大家介绍的python 检测url 状态,希望对大家有所帮助,如果大家有任何疑问请给我留言,小编会及时回复大家的。在此也非常感谢大家对编程之家网站的支持

如果你觉得本文对你有帮助,欢迎转载,烦请注明出处,谢谢!

相关文章

功能概要:(目前已实现功能)公共展示部分:1.网站首页展示...
大体上把Python中的数据类型分为如下几类: Number(数字) ...
开发之前第一步,就是构造整个的项目结构。这就好比作一幅画...
源码编译方式安装Apache首先下载Apache源码压缩包,地址为ht...
前面说完了此项目的创建及数据模型设计的过程。如果未看过,...
python中常用的写爬虫的库有urllib2、requests,对于大多数比...