如何使用 Python 从合并的 Word 文档文件中删除重复文本?

问题描述

正如问题所述,我有一个使用 aspose 创建的合并 Word 文档文件代码如下:

import os
import asposewordscloud
import asposewordscloud.models.requests
from shutil import copyfile


# Please get your Client ID and Secret from https://dashboard.aspose.cloud.
client_id='my_id_#'
client_secret='my_secret_#'

words_api = asposewordscloud.WordsApi(client_id,client_secret)
words_api.api_client.configuration.host='https://api.aspose.cloud'


remoteFolder = 'Documents/'
localFolder = '/mnt/c/Users/%user%/Documents'
localFileName = 'new_merged_doc.docx'
remoteFileName = 'new_merged_doc.docx'
localFileName1 = 'rainer_docs.docx'
remoteFileName1 = 'rainer_docs.docx'

#upload file
words_api.upload_file(asposewordscloud.models.requests.UploadFileRequest(open(localFolder + '/' + localFileName,'rb'),remoteFolder + '/' + remoteFileName))
words_api.upload_file(asposewordscloud.models.requests.UploadFileRequest(open(localFolder + '/' + localFileName1,remoteFolder + '/' + remoteFileName1))

#append Word documents
requestDocumentListDocumentEntries0 = asposewordscloud.DocumentEntry(href=remoteFolder + '/' + remoteFileName1,import_format_mode='KeepSourceFormatting')

requestDocumentListDocumentEntries = [requestDocumentListDocumentEntries0]
requestDocumentList = asposewordscloud.DocumentEntryList(document_entries=requestDocumentListDocumentEntries)
request = asposewordscloud.models.requests.AppendDocumentRequest(name=remoteFileName,document_list=requestDocumentList,folder=remoteFolder,dest_file_name= remoteFolder + '/' + remoteFileName)

result = words_api.append_document(request)

#download file
request_download=asposewordscloud.models.requests.DownloadFileRequest(remoteFolder + '/' + remoteFileName)
response_download = words_api.download_file(request_download)
copyfile(response_download,localFolder + '/' +"new_merged_doc2.docx")

源文档和目标文档具有完全相同的文本,只是样式不同。我指定我要保留源格式。但是,我不想要重复的文本。我希望目标文档中的文本被覆盖,以便只保留源文档的文本样式。

到目前为止,我在 Aspose 中一无所获,但我可能遗漏了一些东西。任何帮助将不胜感激。

解决方法

暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!

如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@)