加载word2vec模型时出现数组重整错误

问题描述

我有以下代码:

from gensim.models import Word2Vec
model = Word2Vec.load('model2')
X = model[model.wv.vocab]

这段代码可以在我的一台机器上运行,但不能在另一台机器上运行。模型文件是相同的。这是怎么回事?我收到的错误消息如下:

  File "/home/ec2-user/miniconda3/envs/word2vec/lib/python3.7/site-packages/gensim/models/word2vec.py",line 1330,in load
    model = super(Word2Vec,cls).load(*args,**kwargs)
  File "/home/ec2-user/miniconda3/envs/word2vec/lib/python3.7/site-packages/gensim/models/base_any2vec.py",line 1244,in load
    model = super(BaseWordEmbeddingsModel,line 603,in load
    return super(BaseAny2VecModel,cls).load(fname_or_handle,**kwargs)
  File "/home/ec2-user/miniconda3/envs/word2vec/lib/python3.7/site-packages/gensim/utils.py",line 427,in load
    obj._load_specials(fname,mmap,compress,subname)
  File "/home/ec2-user/miniconda3/envs/word2vec/lib/python3.7/site-packages/gensim/utils.py",line 458,in _load_specials
    getattr(self,attrib)._load_specials(cfname,line 469,in _load_specials
    val = np.load(subname(fname,attrib),mmap_mode=mmap)
  File "/home/ec2-user/miniconda3/envs/word2vec/lib/python3.7/site-packages/numpy/lib/npyio.py",line 440,in load
    pickle_kwargs=pickle_kwargs)
  File "/home/ec2-user/miniconda3/envs/word2vec/lib/python3.7/site-packages/numpy/lib/format.py",line 771,in read_array
    array.shape = shape
ValueError: cannot reshape array of size 16777184 into shape (134441,128)

要安装gensim,我使用了conda install -c anaconda gensim

解决方法

我检查了@gojomo在评论中所指的内容,他是正确的,我的文件大小不正确。上传期间一定发生了一些事情。对于大型模型,word2vec将模型保存为3个文件。假设您的模型名称是“ model2”,您将拥有:

  1. model2
  2. model2.trainables.syn1neg.npy
  3. model2.wv.vectors.npy

我的.wv.vectors.npy比其他机器上的版本小了几千字节。

相关问答

错误1:Request method ‘DELETE‘ not supported 错误还原:...
错误1:启动docker镜像时报错:Error response from daemon:...
错误1:private field ‘xxx‘ is never assigned 按Alt...
报错如下,通过源不能下载,最后警告pip需升级版本 Requirem...