读取zlib / miniz压缩数据时出现DATA_ERROR

问题描述

我正在为miniz-cpp的zlib压缩实现编写一个简单的C ++包装。通货紧缩可以正常工作,但是现在我又无法再次填充数据了。

代码

我有一个测试用例(大致简化了)归结为:

ByteArray randomData = createrandomData(1024 * 1024);
ByteArray deflatedBytes = deflate(randomData);
writetoTmpFile(deflatedBytes); // for manual review
ByteArray inflatedBytes = inflate(deflatedBytes);

assert(randomData == inflatedBytes);

我被卡在DATA_ERROR (-3)上,当我再次为数据充气时会返回该// inflates the next <size> bytes and stores them in <out[]> // stores the actually written amount in <written> ResultCode Inflator::inflate(uint8_t out[],size_t size,size_t& written) { zStream.next_out = out; zStream.avail_out = static_cast<unsigned int>(size); // loop until output buffer is completely filled while (zStream.avail_out != 0) { if (zStream.avail_in == 0) { // our Inflator stores a ByteArrayInputStream from which // we request more data size_t read = iStream.read(in,BUFFER_SIZE); if (iStream.err()) { return ResultCode::STREAM_ERROR; } zStream.next_in = in; zStream.avail_in = static_cast<unsigned int>(read); } // THIS IS WHERE WE ACTUALLY CALL INFLATE. // RESULT CODE -3 (DATA_ERROR) IS RETURNED AFTER READING // ONLY 13 BYTES. ResultCode result{mz_inflate(&zStream,Flushing::NONE)}; if (result == ResultCode::STREAM_END) { written = size - zStream.avail_out; this->eof_ = true; return ResultCode::OK; } else if (result != ResultCode::OK) { return result; } } written = size - zStream.avail_out; return ResultCode::OK; } 。 这是发生问题的函数

zStream

我的数据

我已经在调试器中验证了我读取的数据是正确的:

enter image description here

您可以看到mz_stream的{​​{1}}在next_in中具有有效的zlib编码数据。至少它以0x78开头。 正如我在伪代码中提到的,我还将数据转储到磁盘。使用以下命令可以很好地读取此数据:

# this command is included in the qpdf package and uncompresses zlib streams
zlib-flate -uncompress < 'mve_deflOutput.zlib' > 'mve_deflOutput.bin'

这也是前几个字节的十六进制转储:

00000000: 7801 a4dd fb7f cfe5 1b07 7072 c8a9 9632  x.........pr...2
00000010: 49ac 9043 9a4c 392c 462c 4d88 8ab0 4a7c  I..C.L9,F,M...J|
00000020: 55d6 2c6d 6921 0931 34ad 4db5 8898 1c26  U.,mi!.14.M....&
00000030: 3a88 ce69 51d9 6a52 94a8 302d d252 6ba6  :..iQ.jR..0-.Rk.
00000040: 84a2 b2ef 9ff0 fce1 be7f dd63 dbe7 f37e  ...........c...~
00000050: dff7 75bd aed7 eb75 5df7 11ac 4358 3763  ..u....u]...CX7c
00000060: b5c0 ea88 550f ab33 d62e aca7 b132 b116  ....U..3.....2..
00000070: 611d c73a 8935 096b 0d56 05d6 8758 a3b1  a..:.5.k.V...X..
00000080: f0ef d7b4 c5c2 bfff f027 2c3c be93 b5b1  .........',<....
00000090: cec2 1a80 7531 5612 d68d 583b b0d6 622d  ....u1V...X;..b-

错误

无论出于何种原因,对mz_inflate的模仿zlib的inflate调用都会返回DATA_ERROR (-3)total_in中的zStream字段设置为13,因此看起来在错误发生之前只读取了13个字节。

总结一下:如果放气后的数据正常,可以使用zlib-flate进行提取,那么为什么不能最小化读取此数据?它是按字面意思写的。如果前13个字节有问题,我看不出可能是什么。

For reference,here is the full code of the Inflator and the test.

解决方法

这是一个防反跳解决方案,但事实证明,我应该使用实际的miniz而不是miniz-cpp的单头镜像。这个单头库使用的是自2017年以来库的严重过时版本,根本无法正确读取数据。

使用实际的$ cat Dockerfile FROM node:12 RUN apt-get update && apt-get install -y libasound2 libxtst6 RUN wget https://download.bell-sw.com/java/11.0.7+10/bellsoft-jdk11.0.7+10-linux-amd64.deb && \ apt install ./bellsoft-jdk11.0.7+10-linux-amd64.deb $ docker build --tag mynode:1.1 . $ docker run -it mynode:1.1 /bin/bash root@37771ce98727:/# java -version openjdk version "11.0.7" 2020-04-14 LTS OpenJDK Runtime Environment (build 11.0.7+10-LTS) OpenJDK 64-Bit Server VM (build 11.0.7+10-LTS,mixed mode) root@37771ce98727:/# javac -version javac 11.0.7 时,测试通过了,一切正常。我的代码是100%正确的,只是使用了错误的库。