Python 利用nltk download安装测试包示例

对python这个高级语言感兴趣的小伙伴,下面一起跟随编程之家 jb51.cc的小编两巴掌来看看吧!

接着上一篇文章 机器学习之nltk download出错:Error connecting to server: [Errno -2] ,下面说一下 nltk测试包的安装及要注意的事项

>>> import nltk

>>> nltk.download()

NLTK Downloader

---------------------------------------------------------------------------

d) Download l) List c) Config h) Help q) Quit

---------------------------------------------------------------------------

Downloader> d

Download which package (l=list; x=cancel)?

Identifier>

---------------------------------------------------------------------------

d) Download l) List c) Config h) Help q) Quit

---------------------------------------------------------------------------

这里要注意:这一步的时候要选择l(list)

Downloader> l

Packages:

[ ] brown_tei........... brown Corpus (TEI XML Version)

[ ] punkt............... Punkt Tokenizer Models

[ ] maxent_treebank_pos_tagger Treebank Part of Speech Tagger (Maximum entropy)

[ ] machado............. Machado de Assis -- Obra Completa

[ ] movie_reviews....... Sentiment Polarity Dataset Version 2.0

[ ] names............... Names Corpus,Version 1.3 (1994-03-29)

[ ] nombank.1.0......... NomBank Corpus 1.0

[ ] nps_chat............ NPS Chat

[ ] paradigms........... Paradigm Corpus

[ ] pe08................ Cross-Framework and Cross-Domain Parser

Evaluation Shared Task

[ ] pil................. The Patient information Leaflet (PIL) Corpus

[ ] pl196x.............. Polish language of the XX century sixties

[ ] ppattach............ Prepositional Phrase Attachment Corpus

[ ] problem_reports..... Problem Report Corpus

[ ] propbank............ Proposition Bank Corpus 1.0

[ ] qc.................. Experimental Data for Question Classification

[ ] reuters............. The Reuters-21578 benchmark corpus,ApteMod

version

[ ] rte................. PASCAL RTE Challenges 1,2,and 3

Hit Enter to continue:

查看所有的包,并找到你需要的包,然后不能按照提示收入点击,而是应该这样做:

>>> nltk.download('brown_tei')

注意:该方法可能会出现:<urlopen error [Errno -2] Name or service not kNown>的错误,这时可使用下面的方法解决

或者使用:

python -m nltk.downloader spanish_grammars

相关文章

功能概要:(目前已实现功能)公共展示部分:1.网站首页展示...
大体上把Python中的数据类型分为如下几类: Number(数字) ...
开发之前第一步,就是构造整个的项目结构。这就好比作一幅画...
源码编译方式安装Apache首先下载Apache源码压缩包,地址为ht...
前面说完了此项目的创建及数据模型设计的过程。如果未看过,...
python中常用的写爬虫的库有urllib2、requests,对于大多数比...