ios - AVFoundation でフレーム間ビデオ圧縮を行う方法

Question

私が構築しているアプリケーションで、写真と画像のコレクションからビデオ「スライドショー」を生成するプロセスを作成しました。プロセスは正しく機能していますが、ビデオに含まれる写真が変更されずに 100 ～ 150 フレーム繰り返されると、不必要に大きなファイルが作成されます。AVFoundation で見つけることができる圧縮はすべて含めました。これは主にフレーム内技術を適用し、AVFoundation でフレーム間圧縮に関する詳細情報を見つけようとしました。残念ながら、私が見つけることができた参照はほんのわずかであり、それを機能させるものは何もありません.

誰かが私を正しい方向に導くことができることを願っています。ビデオジェネレーターのコードを以下に示します。個々のフレームをフェッチして準備するためのコード (以下では self.getFrame() と呼ばれます) は含めていません。これは正常に機能しているように見え、写真、ビデオ、タイトルフレームの追加、フェードトランジションの実行などを処理するため、非常に複雑になるためです。 . 繰り返されるフレームの場合、フレームイメージと含める出力フレーム数のカウンターを含む構造体を返します。

        // Create a new AVAssetWriter Instance that will build the video

        assetWriter = createAssetWriter(path: filePathNew, size: videoSize!)
        guard assetWriter != nil else
        {
            print("Error converting images to video: AVAssetWriter not created.")
            inProcess = false
            return
        }

        let writerInput = assetWriter!.inputs.filter{ $0.mediaType == AVMediaTypeVideo }.first!

        let sourceBufferAttributes : [String : AnyObject] = [
            kCVPixelBufferPixelFormatTypeKey as String : Int(kCVPixelFormatType_32ARGB) as AnyObject,
            kCVPixelBufferWidthKey as String : videoSize!.width as AnyObject,
            kCVPixelBufferHeightKey as String : videoSize!.height as AnyObject,
            AVVideoMaxKeyFrameIntervalKey as String : 50 as AnyObject,
            AVVideoCompressionPropertiesKey as String : [
                AVVideoAverageBitRateKey: 725000,
                AVVideoProfileLevelKey: AVVideoProfileLevelH264Baseline30,
                ] as AnyObject
        ]

        let pixelBufferAdaptor = AVAssetWriterInputPixelBufferAdaptor(assetWriterInput: writerInput, sourcePixelBufferAttributes: sourceBufferAttributes)

        // Start the writing session

        assetWriter!.startWriting()

        assetWriter!.startSession(atSourceTime: kCMTimeZero)

        if (pixelBufferAdaptor.pixelBufferPool == nil) {
            print("Error converting images to video: pixelBufferPool nil after starting session")
            inProcess = false
            return
        }

        // -- Create queue for <requestMediaDataWhenReadyOnQueue>

        let mediaQueue = DispatchQueue(label: "mediaInputQueue")

        // Initialize run time values

        var presentationTime = kCMTimeZero
        var done = false
        var nextFrame: FramePack?                // The FramePack struct has the frame to output, noDisplays - the number of times that it will be output
                                                 // and an isLast flag that is true when it's the final frame

        writerInput.requestMediaDataWhenReady(on: mediaQueue, using: { () -> Void in    // Keeps invoking the block to get input until call markAsFinished

            nextFrame = self.getFrame()          // Get the next frame to be added to the output with its associated values
            let imageCGOut = nextFrame!.frame    // The frame to output
            if nextFrame!.isLast { done = true } // Identifies the last frame so can drop through to markAsFinished() below

            var frames = 0                       // Counts how often we've output this frame
            var waitCount = 0                    // Used to avoid an infinite loop if there's trouble with writer.Input

            while (frames < nextFrame!.noDisplays) && (waitCount < 1000000)  // Need to wait for writerInput to be ready - count deals with potential hung writer
            {
                waitCount += 1
                if waitCount == 1000000     // Have seen it go into 100s of thousands and succeed
                {
                    print("Exceeded waitCount limit while attempting to output slideshow frame.")
                    self.inProcess = false
                    return
                }

                if (writerInput.isReadyForMoreMediaData)
                {
                    waitCount = 0
                    frames += 1

                    autoreleasepool
                        {
                            if  let pixelBufferPool = pixelBufferAdaptor.pixelBufferPool
                            {
                                let pixelBufferPointer = UnsafeMutablePointer<CVPixelBuffer?>.allocate(capacity: 1)
                                let status: CVReturn = CVPixelBufferPoolCreatePixelBuffer(
                                    kCFAllocatorDefault,
                                    pixelBufferPool,
                                    pixelBufferPointer
                                )

                                if let pixelBuffer = pixelBufferPointer.pointee, status == 0
                                {
                                    CVPixelBufferLockBaseAddress(pixelBuffer, CVPixelBufferLockFlags(rawValue: CVOptionFlags(0)))
                                    let pixelData = CVPixelBufferGetBaseAddress(pixelBuffer)
                                    let rgbColorSpace = CGColorSpaceCreateDeviceRGB()

                                    // Set up a context for rendering using the PixelBuffer allocated above as the target

                                    let context = CGContext(
                                        data: pixelData,
                                        width: Int(self.videoWidth),
                                        height: Int(self.videoHeight),
                                        bitsPerComponent: 8,
                                        bytesPerRow: CVPixelBufferGetBytesPerRow(pixelBuffer),
                                        space: rgbColorSpace,
                                        bitmapInfo: CGImageAlphaInfo.premultipliedFirst.rawValue
                                    )

                                    // Draw the image into the PixelBuffer used for the context

                                    context?.draw(imageCGOut, in: CGRect(x: 0.0,y: 0.0,width: 1280, height: 720))

                                    // Append the image (frame) from the context pixelBuffer onto the video file

                                    _ = pixelBufferAdaptor.append(pixelBuffer, withPresentationTime: presentationTime)
                                    presentationTime = presentationTime + CMTimeMake(1, videoFPS)

                                    // We're done with the PixelBuffer, so unlock it

                                    CVPixelBufferUnlockBaseAddress(pixelBuffer, CVPixelBufferLockFlags(rawValue: CVOptionFlags(0)))
                                }

                                pixelBufferPointer.deinitialize()
                                pixelBufferPointer.deallocate(capacity: 1)

                            } else {
                                NSLog("Error: Failed to allocate pixel buffer from pool")
                            }
                    }
                }
            }

ご提案いただきありがとうございます。

score 2 · Accepted Answer

それはあなたのように見えます

冗長なフレームをビデオに追加し、
ビデオファイルは一定の高いフレームレート (たとえば 30fps) を持っていなければならないという誤解の下で働いています。

たとえば、15 秒間にわたって 3 つの画像のスライドショーを表示している場合、プレゼンテーションのタイムスタンプが 0、5、10、および) が 15 秒ではなく 15 秒の3 つの画像のみを出力する必要がありassetWriter.endSession(atSourceTime:ます* 30 FPS = 450 フレーム。

言い換えれば、あなたのフレームレートは高すぎます- お金で買える最高のフレーム間圧縮を得るには、フレームレートを必要最小限のフレーム数まで下げると、すべてがうまくいきます^* .

^{*_{一部のビデオサービス/プレーヤーが異常に低いフレームレートでチョークするのを見

たことがあります。そのため、最小フレームレートといくつかの冗長フレーム (1frame/5s、ymmvなど) が必要になる場合があります。}}

ios - AVFoundation でフレーム間ビデオ圧縮を行う方法

1 に答える 1

Related

Reference