spring-mvc - java.util.zip.ZipEntry および apache 圧縮 ZipEntry により一部のイメージファイルが破損する

Question

Spring のマルチパートアップローダコントローラを使用して、zip ファイルからエントリをアップロードおよび保存していますが、時折 PNG ファイルが破損していることに気付きました。「fþ»ÀígÞÉ」などで。これは、実行ごとに同じファイルに発生するようです。java.util.ZipEntry を使用してこれらすべてを試した後、Apache Compress を試したところ、Apache Compress が Java 7 ユーティリティとは異なるファイルを破損していることがわかりましたが、その後の実行では常に同じファイルでした。

コード (最初に java.util.zip.ZipEntry):

protected void processZipFile(String path, MultipartFile file, String signature) throws IOException {

    DateFormat df = new SimpleDateFormat("yyyyMMddhhmmss");
    File tempFile = new File(System.getProperty("user.dir") + "/" + file.getName() + df.format(new Date()));
    file.transferTo(tempFile);

    ZipFile zipFile = null;
    try {
        zipFile = new ZipFile(tempFile);

        LOG.debug("Processing archive with name={}, size={}.", file.getName(), file.getSize());

        final Enumeration<? extends ZipEntry> entries = zipFile.entries();
        while ( entries.hasMoreElements() )
        {
            ZipEntry entry = entries.nextElement();

            LOG.debug("Processing file={} is directory?={}.", entry.getName(), entry.isDirectory());

            // we don't bother processing directories, and we don't process any resource fork info
            // from Mac OS X (which does not seem to be transparent to ZipFile).

            if (!(entry.isDirectory() || entry.getName().contains("__MACOSX")  || entry.getName().contains(".DS_Store"))) {
                // if the entry is a file, extract it


                Content contentToSave = null;

                if(entry.getName().contains("gif") || entry.getName().contains("png") || entry.getName().contains("jpeg")) {

                    byte[] bytes = readInputStream( zipFile.getInputStream( entry ), entry.getSize() );

                    LOG.debug("{} is of inflated-length={} from compressed-length={}",
                            entry.getName(), bytes.length, entry.getCompressedSize());

                    if(entry.getName().contains("gif")) {
                        contentToSave = Content.makeImage(path + entry.getName(), Content.GIF, signature, bytes);

                    } else if (entry.getName().contains("png")) {
                        contentToSave = Content.makeImage(path + entry.getName(), Content.PNG, signature, bytes);

                    } else if (entry.getName().contains("jpeg")) {
                        contentToSave = Content.makeImage(path + entry.getName(), Content.JPEG, signature, bytes);

                    }
                } else {

                    InputStream is = zipFile.getInputStream(entry);

                    if (entry.getName().contains("json")) {
                        contentToSave = Content.makeFile(path + entry.getName(), Content.JSON, signature, convertStreamToString(is));
                    } else if (entry.getName().contains("js")) {
                        contentToSave = Content.makeFile(path + entry.getName(), Content.JS, signature, convertStreamToString(is));
                    } else if (entry.getName().contains("css")) {
                        contentToSave = Content.makeFile(path + entry.getName(), Content.CSS, signature, convertStreamToString(is));
                    } else if (entry.getName().contains("xml")) {
                        contentToSave = Content.makeFile(path + entry.getName(), Content.XML, signature, convertStreamToString(is));
                    } else if (entry.getName().contains("html")) {
                        contentToSave = Content.makeFile(path + entry.getName(), Content.HTML, signature, convertStreamToString(is));
                    }
                }

                contentService.putOrReplace(contentToSave);
                LOG.info("Persisted file: {} from uploaded version.", contentToSave.getName());
            }

        }
    } catch (ZipException e) {
        // If I can't create a ZipFile, then this is not a zip file at all and it cannot be processed
        // by this method. Its pretty dumb that there's no way to determine whether the contents are zipped through
        // the ZipFile API, but that's just one of its many problems.
        e.printStackTrace();
        LOG.error("{} is not a zipped file, or it is empty", file.getName());
    } finally {
        zipFile = null;
    }
    tempFile.delete();

}

org.apache.commons.compress.archivers.zip.ZipFile についても同じことが言えます。

protected void processZipFile(String path, MultipartFile file, String signature) throws IOException {

    DateFormat df = new SimpleDateFormat("yyyyMMddhhmmss");
    File tempFile = new File(System.getProperty("user.dir") + "/" + file.getName() + df.format(new Date()));
    file.transferTo(tempFile);

    ZipFile zipFile = null;
    try {
        zipFile = new ZipFile(tempFile);

        LOG.debug("Processing archive with name={}, size={}.", file.getName(), file.getSize());

        final Enumeration<? extends ZipArchiveEntry> entries = zipFile.getEntries();
        while ( entries.hasMoreElements() ) {
            ZipArchiveEntry entry = entries.nextElement();

            LOG.debug("Processing file={} is directory?={}.", entry.getName(), entry.isDirectory());

                // we don't bother processing directories, and we don't process any resource fork info
                // from Mac OS X (which does not seem to be transparent to ZipFile).

                if (!(entry.isDirectory() || entry.getName().contains("__MACOSX")  || entry.getName().contains(".DS_Store"))) {
                    // if the entry is a file, extract it

                    Content contentToSave = null;

                    if(entry.getName().contains("gif") || entry.getName().contains("png") || entry.getName().contains("jpeg")) {

                        byte[] bytes = readInputStream( zipFile.getInputStream( entry ), entry.getSize() );

                        LOG.debug("{} is of inflated-length={} from compressed-length={}",
                            entry.getName(), bytes.length, entry.getCompressedSize());

                        if(entry.getName().contains("gif")) {
                            contentToSave = Content.makeImage(path + entry.getName(), Content.GIF, signature, bytes);

                        } else if (entry.getName().contains("png")) {
                            contentToSave = Content.makeImage(path + entry.getName(), Content.PNG, signature, bytes);

                        } else if (entry.getName().contains("jpeg")) {
                            contentToSave = Content.makeImage(path + entry.getName(), Content.JPEG, signature, bytes);

                        }
                    } else {

                        InputStream is = zipFile.getInputStream(entry);

                        if (entry.getName().contains("json")) {
                            contentToSave = Content.makeFile(path + entry.getName(), Content.JSON, signature, convertStreamToString(is));
                        } else if (entry.getName().contains("js")) {
                            contentToSave = Content.makeFile(path + entry.getName(), Content.JS, signature, convertStreamToString(is));
                        } else if (entry.getName().contains("css")) {
                            contentToSave = Content.makeFile(path + entry.getName(), Content.CSS, signature, convertStreamToString(is));
                        } else if (entry.getName().contains("xml")) {
                            contentToSave = Content.makeFile(path + entry.getName(), Content.XML, signature, convertStreamToString(is));
                        } else if (entry.getName().contains("html")) {
                            contentToSave = Content.makeFile(path + entry.getName(), Content.HTML, signature, convertStreamToString(is));
                        }
                    }

                    contentService.putOrReplace(contentToSave);
                    LOG.info("Persisted file: {} from uploaded version.", contentToSave.getName());
                }

        }
    } catch (ZipException e) {
        e.printStackTrace();
        LOG.error("{} is not a zipped file, or it is empty", file.getName());
    } catch (IOException e) {
        e.printStackTrace();
        LOG.error("{} is not a file, or it is empty", file.getName());
    } finally {
        zipFile = null;
    }
    tempFile.delete();
}

呼び出される 2 つのメソッドは次のとおりです。

private static byte[] readInputStream( final InputStream is, final long length ) throws IOException {
    final byte[] buf = new byte[ (int) length ];
    int read = 0;
    int cntRead;
    while ( ( cntRead = is.read( buf, 0, buf.length ) ) >=0  )
    {
        read += cntRead;
    }
    return buf;
}

と：

public String convertStreamToString(InputStream is) throws IOException {
    StringBuilder sb = new StringBuilder(2048);
    char[] read = new char[128];
    try (InputStreamReader ir = new InputStreamReader(is, StandardCharsets.UTF_8)) {
        for (int i; -1 != (i = ir.read(read)); sb.append(read, 0, i));
    }

    // need to remove the ? at teh beginning of some files. This comes from the UTF8 BOM
    // that is added to some files saved as UTF8
    String out = sb.toString();
    String utf8Bom = new String(new char[]{'\ufeff'});
    if(out.contains(utf8Bom)) {
        out = out.replace(utf8Bom,"");
    }
    return out;
}

もちろん、2 つ目はおそらく問題の一部ではありません。

私はググってみましたが、これに似た問題が見つかったようですが、それは常に外部の問題でした. なぜこれが当てはまるのか誰にもわかりますか？

いくつかの画像を再編集したところ、画像を白黒に変更したり、画像全体の色相を変更したりすると問題はなくなりますが、境界線を追加したり単一の色を変更したりすると問題が残ります。一部のファイルのバイトの特定の配置は、Java 自身と Apache の圧縮ファイルリーダーの両方が使用する基礎となる API のバグをくすぐるように見えますが、それは単なる憶測です。

EDIT：追加の使用法は、サイズが10Kを超えるgifで破損が発生することを示しているため、おそらくこれはバグと関係がありますか？ReadInputStream() の呼び出しでバッファのサイズを任意に 2 倍にしようとしましたが、MySQL で特に大きな画像 (49K が 98K になり、大きすぎた) で blob サイズがオーバーフローする以外は何もしませんでした。

com.mysql.jdbc.MysqlDataTruncation: Data truncation: Data too long for column 'encoded_content' at row 1

score 0 · Accepted Answer

私の発見は、「パックされたサイズ」が実際のサイズよりも大きい場合にこの問題が発生することです。これは、たとえば、既に「圧縮」されている png ファイルで発生する可能性があります。

spring-mvc - java.util.zip.ZipEntry および apache 圧縮 ZipEntry により一部のイメージ ファイルが破損する

1 に答える 1

Related

Reference

spring-mvc - java.util.zip.ZipEntry および apache 圧縮 ZipEntry により一部のイメージファイルが破損する