尝试创建词汇表时请求错误

问题描述

我想使用Google的操作指南https://cloud.google.com/translate/docs/advanced/glossary#unidirectional_glossary

中的示例命令创建一个单向词汇表,以与我的翻译项目一起使用

没有使用python代码创建单向词汇表的示例,仅适用于等效的集合词汇表,而且我不知道代码中要更改什么。

我创建了一个存储分区并上传了我的词汇表文件

然后我尝试在powershell中执行此命令:

$cred = gcloud auth application-default print-access-token
$headers = @{ "Authorization" = "Bearer $cred" }

Invoke-WebRequest `
  -Method POST `
  -Headers $headers `
  -ContentType: "application/json; charset=utf-8" `
  -InFile request.json `
  -Uri "https://translation.googleapis.com/v3/projects/[HIDDEN]/locations/us-east1/glossaries
" | Select-Object -Expand Content

这是基于他们的示例的request.json文件内容

{
  "name":"projects/[HIDDEN]/locations/us-east1/glossaries/kittglossary","languagePair": {
    "sourceLanguageCode": "en","targetLanguageCode": "hu"
    },"inputConfig": {
    "gcsSource": {
      "inputUri": "gs://kittgloss/glossary.csv"
    }
  }
}

我得到此错误返回:

Invoke-WebRequest : The Remote Server returned an error: (400) Bad Request.
At line:4 char:1
+ Invoke-WebRequest `
+ ~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : InvalidOperation: (System.Net.HttpWebRequest:HttpWebRequest) [Invoke-WebRequest],WebExc
   eption
    + FullyQualifiedErrorId : WebCmdletWebResponseException,Microsoft.PowerShell.Commands.InvokeWebRequestCommand

我确实具有GOOGLE_APPLICATION_CREDENTIALS环境变量,并且在尝试测试翻译之前,隐式身份验证已经起作用


尝试使用示例python代码创建词汇表后:

from google.cloud import translate_v3 as translate

pid = "[HIDDEN]",iuri = "gs://kittgloss/glossary.csv",gid = "kittglossary",def create_glossary(
        project_id,input_uri,glossary_id,timeout,):
    """
    Create a equivalent term sets glossary. Glossary can be words or
    short phrases (usually fewer than five words).
    https://cloud.google.com/translate/docs/advanced/glossary#format-glossary
    """
    client = translate.TranslationServiceClient()

    # Supported language codes: https://cloud.google.com/translate/docs/languages
    source_lang_code = "en"
    target_lang_code = "hu"
    location = "us-east1"  # The location of the glossary

    name = client.glossary_path(project_id,location,glossary_id)
    language_codes_set = translate.types.Glossary.LanguageCodesSet(
        language_codes=[source_lang_code,target_lang_code]
    )

    gcs_source = translate.types.GcsSource(input_uri=input_uri)

    input_config = translate.types.GlossaryInputConfig(gcs_source=gcs_source)

    glossary = translate.types.Glossary(
        name=name,language_codes_set=language_codes_set,input_config=input_config
    )

    parent = client.location_path(project_id,location)
    # glossary is a custom dictionary Translation API uses
    # to translate the domain-specific terminology.
    operation = client.create_glossary(parent=parent,glossary=glossary)

    result = operation.result(timeout)
    print("Created: {}".format(result.name))
    print("Input Uri: {}".format(result.input_config.gcs_source.input_uri))


create_glossary(pid,iuri,gid,timeout=180)

我收到以下错误,抱怨文件元组而不是str:

Traceback (most recent call last):
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\google\protobuf\internal\python_message.py",line 702,in field_setter
    new_value = type_checker.CheckValue(new_value)
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\google\protobuf\internal\type_checkers.py",line 215,in CheckValue
    raise TypeError(message)
TypeError: ('gs://kittgloss/glossary.csv',) has type <class 'tuple'>,but expected one of: (<class 'bytes'>,<class 'str'>)

During handling of the above exception,another exception occurred:

Traceback (most recent call last):
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\google\protobuf\internal\python_message.py",line 558,in init
    setattr(self,field_name,field_value)
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\google\protobuf\internal\python_message.py",line 704,in field_setter
    raise TypeError(
TypeError: Cannot set google.cloud.translation.v3.GcsSource.input_uri to ('gs://kittgloss/glossary.csv',): ('gs://kittgloss/glossary.csv',another exception occurred:

Traceback (most recent call last):
  File "C:\py\glossary.py",line 48,in <module>
    create_glossary(pid,timeout=180)
  File "C:\py\glossary.py",line 30,in create_glossary
    gcs_source = translate.types.GcsSource(input_uri=input_uri)
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\proto\message.py",line 421,in __init__
    self.__dict__["_pb"] = self._Meta.pb(**params)
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\google\protobuf\internal\python_message.py",line 560,in init
    _ReraiseTypeErrorWithFieldName(message_descriptor.name,field_name)
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\google\protobuf\internal\python_message.py",line 477,in _ReraiseTypeErrorWithFieldName
    six.reraise(type(exc),exc,sys.exc_info()[2])
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\six.py",in reraise
    raise value.with_traceback(tb)
  File "C:\Users\pc\AppData\Local\Programs\Python\python38-32\lib\site-packages\google\protobuf\internal\python_message.py",<class 'str'>) for field GcsSource.input_uri

词汇表文件非常简单,前几行看起来像这样:

rear bumpers,hátsó lökhárító
front bumper spoiler,első lökhárító spoiler
front bumpers,első lökhárító

我将不胜感激。

解决方法

我通过将自己的词汇表转换为一个简单的EN到HU等效术语表来解决自己的问题,方法是在我的csv文件中添加标头,如下所示:

前几行

en,hu,pos
rear bumpers,hátsó lökhárító,noun
front bumper spoiler,első lökhárító spoiler,noun
front bumpers,első lökhárító,noun

然后,我使用示例python代码进行了一些修改以创建词汇表。我遇到的一个问题是,词汇表显然只能在us-central1和global中创建。我知道我的代码仅通过使用字符串就不会看起来很漂亮,但它确实有效:

from google.cloud import translate_v3beta1 as translate
def create_glossary():

    client = translate.TranslationServiceClient()
    ## Set your project name
    project_id = 'flawless-acre-284812'
    ## Set your wished glossary-id
    glossary_id = 'kittglossaryv2'
    ## Set your location
    location = 'us-central1'  # The location of the glossary

    name = client.glossary_path(
        project_id,location,glossary_id)

    language_codes_set = translate.types.Glossary.LanguageCodesSet(
        language_codes=['en','hu'])
    ## SET YOUR BUCKET URI
    gcs_source = translate.types.GcsSource(
        input_uri='gs://kittgloss/etglossaryv2.csv')

    input_config = translate.types.GlossaryInputConfig(
        gcs_source=gcs_source)

    glossary = translate.types.Glossary(
        name=name,language_codes_set=language_codes_set,input_config=input_config)

    parent = 'projects/flawless-acre-284812/locations/us-central1'

    operation = client.create_glossary(parent=parent,glossary=glossary)

    result = operation.result(timeout=90)
    print('Created: {}'.format(result.name))
    print('Input Uri: {}'.format(result.input_config.gcs_source.input_uri))
    
create_glossary()

希望这对某人有帮助

相关问答

Selenium Web驱动程序和Java。元素在(x,y)点处不可单击。其...
Python-如何使用点“。” 访问字典成员?
Java 字符串是不可变的。到底是什么意思?
Java中的“ final”关键字如何工作?(我仍然可以修改对象。...
“loop:”在Java代码中。这是什么,为什么要编译?
java.lang.ClassNotFoundException:sun.jdbc.odbc.JdbcOdbc...