打印导致LangDetectException的行

问题描述

我无法进行语言检测。 下面的代码引发异常错误

from langdetect import detect
for row in df['Comments']:
    text = str(row)
    language_code = detect(text)
    sentence = [all_languages_codes.get(language_code)]
    df['Language']=sentence[0]

错误消息:

    148         ngrams = self._extract_ngrams()
    149         if not ngrams:
--> 150             raise LangDetectException(ErrorCode.CantDetectError,'No features in text.')
    151 
    152         self.langprob = [0.0] * len(self.langlist)

LangDetectException: No features in text.

如何打印出导致LangDetectException的行?

解决方法

您的目录字符串似乎为空:

detect("")
LangDetectException: No features in text.

您可以启动调试器或交互式外壳程序来确定,将其包装在try / except块中,并在引发异常时启动调试器:

from langdetect import detect
for row in df['Comments']:
    try:
        text = str(row)
        language_code = detect(text)
        sentence = [all_languages_codes.get(language_code)]
        df['Language']=sentence[0]
    except Exception:
        import ipdb; ipdb.set_trace()