使用python脚本生成xml文件时出现缩进错误

问题描述

我正在尝试通过读取Excel工作表来使用python脚本创建XML文件。使用yattag我可以完成此操作,尽管不是我需要的格式。 我已经粘贴了下面的代码,并且已经确认没有空格/制表符的混合。

目标是将整个项目包装在“ node”标签中,并为两个“ category”标签增加2个子类别。我收到错误消息是因为在“节点”标签之后,在“位置”标签之前有2个标签。如果纠正错误,我将获得第一组代码。基本上,只要有必要,就只需将''下拉至底部

<node type="document" action="create">
        <location>TempCD</location>
        <title>doc1</title>
        <file>E:\Doc1.docx</file>
        <mime>application</mime>
    </node>
    <category name="Content">
        <attribute name="Function">asd</attribute>
        <attribute name="commodity">sf</attribute>
        <attribute name="Sub-commodity">qw</attribute>
        <attribute name="Contract/Document Owner">e</attribute>
        <subitems>reapply</subitems>
    </category>
    <category name="Content Server Categories:LYB:LYB-GSC-Contracts">
        <attribute name="supplier">Altom Transport</attribute>
        <attribute name="Pricing Terms">Fixed</attribute>
        <attribute name="Term Type">Fixed</attribute>
        <subitems name="commodity">reapply</subitems>
    </category>
     from openpyxl import load_workbook
        from yattag import Doc,indent
        
        wb = load_workbook("input_sample.xlsx")
        ws = wb.worksheets[0]
        
        # Create Yattag doc,tag and text objects
        doc,tag,text = Doc().tagtext()
        
        xml_header = '<?xml version="1.0" encoding="UTF-8"?>'
        xml_schema = '<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema"></xs:schema>'
        
        doc.asis(xml_header)
        doc.asis(xml_schema)
        
        for row in ws.iter_rows(min_row=2):
            row = [cell.value for cell in row]
            with tag('node',type=row[0],action=row[1]):
                    with tag("location"): text(row[2])
                    with tag("title"): text(row[3])
                    with tag("file"): text(row[4])
                    with tag("mime"): text(row[5])
                with tag('category',name=row[6]):
                    with tag("attribute",name='Function'): text(row[7])
                    with tag("attribute",name='commodity'): text(row[8])
                    with tag("attribute",name='Sub-commodity'): text(row[9])
                    with tag("attribute",name='Contract/Document Owner'): text(row[10])
                    with tag("subitems"): text("reapply")
                with tag('category',name=row[11]):
                    with tag("attribute",name='supplier'): text(row[12])
                    with tag("attribute",name='Pricing Terms'): text(row[13])
                    with tag("attribute",name='Term Type'): text(row[14])
                    with tag("subitems"): text("reapply")
        
        result = indent(
            doc.getvalue(),indentation = '    ',indent_text = False
        )
        
        with open("test_resulted.xml","w") as f:
            f.write(result)

解决方法

这应该为您提供所需的xml:

from openpyxl import load_workbook
from yattag import Doc,indent

wb = load_workbook("input_sample.xlsx")
ws = wb.worksheets[0]

# Create Yattag doc,tag and text objects
doc,tag,text = Doc().tagtext()

xml_header = '<?xml version="1.0" encoding="UTF-8"?>'
xml_schema = '<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema"></xs:schema>'

doc.asis(xml_header)
#doc.asis(xml_schema)  # invalid

with tag('root'):  # required for valid xml
    for row in ws.iter_rows(min_row=2):
        row = [cell.value for cell in row]
        with tag('node',type=row[0],action=row[1]):
                with tag("location"): text(row[2])
                with tag("title"): text(row[3])
                with tag("file"): text(row[4])
                with tag("mime"): text(row[5])
                with tag('category',name=row[6]):
                    with tag("attribute",name='Function'): text(row[7])
                    with tag("attribute",name='Commodity'): text(row[8])
                    with tag("attribute",name='Sub-Commodity'): text(row[9])
                    with tag("attribute",name='Contract/Document Owner'): text(row[10])
                    with tag("subitems"): text("reapply")
                with tag('category',name=row[11]):
                    with tag("attribute",name='Supplier'): text(row[12])
                    with tag("attribute",name='Pricing Terms'): text(row[13])
                    with tag("attribute",name='Term Type'): text(row[14])
                    with tag("subitems"): text("reapply")
                

result = indent(
doc.getvalue(),indentation = '    ',indent_text = False
)

with open("test_resulted.xml","w") as f:
   f.write(result)

输出

<?xml version="1.0" encoding="UTF-8"?>
<root>
    <node type="2" action="2">
        <location>2</location>
        <title>2</title>
        <file>2</file>
        <mime>2</mime>
        <category name="2">
            <attribute name="Function">2</attribute>
            <attribute name="Commodity">2</attribute>
            <attribute name="Sub-Commodity">2</attribute>
            <attribute name="Contract/Document Owner">2</attribute>
            <subitems>reapply</subitems>
        </category>
        <category name="2">
            <attribute name="Supplier">2</attribute>
            <attribute name="Pricing Terms">2</attribute>
            <attribute name="Term Type">2</attribute>
            <subitems>reapply</subitems>
        </category>
    </node>
    <node>
       ..........
    </node>
    ..............
</root>