使用libavcodec在mpegts容器中的原始H264帧

Question

I would really appreciate some help with the following issue: 对于以下问题，我将非常感谢您的帮助：

I have a gadget with a camera, producing H264 compressed video frames, these frames are being sent to my application. 我有一个带照相机的小工具，它可以产生H264压缩视频帧，这些帧已发送到我的应用程序中。 These frames are not in a container, just raw data. 这些帧不在容器中，而只是原始数据。

I want to use ffmpeg and libav functions to create a video file, which can be used later. 我想使用ffmpeg和libav函数创建视频文件，以后可以使用。

If I decode the frames, then encode them, everything works fine, I get a valid video file. 如果我解码这些帧，然后对其进行编码，则一切正常，我将获得一个有效的视频文件。 (the decode/encode steps are the usual libav commands, nothing fancy here, I took them from the almighty internet, they are rock solid)... However, I waste a lot of time by decoding and encoding, so I would like to skip this step and directly put the frames in the output stream. （解码/编码步骤是通常的libav命令，在这里没什么花哨的地方，我从全能的互联网上拿走了它们，它们坚如磐石）...但是，我在解码和编码上浪费了很多时间，所以我想跳过此步骤，直接将帧放入输出流中。 Now, the problems come. 现在，问题来了。

Here is the code I came up with for producing the encoding: 这是我为产生编码而想到的代码：

AVFrame* picture;

avpicture_fill((AVPicture*) picture, (uint8_t*)frameData, 
                 codecContext->pix_fmt, codecContext->width,
                 codecContext->height);
int outSize = avcodec_encode_video(codecContext, videoOutBuf, 
                 sizeof(videoOutBuf), picture);
if (outSize > 0) 
{
    AVPacket packet;
    av_init_packet(&packet);
    packet.pts = av_rescale_q(codecContext->coded_frame->pts,
                  codecContext->time_base, videoStream->time_base);
    if (codecContext->coded_frame->key_frame) 
    {
        packet.flags |= PKT_FLAG_KEY;
    }
    packet.stream_index = videoStream->index;
    packet.data =  videoOutBuf;
    packet.size =  outSize;

    av_interleaved_write_frame(context, &packet);
    put_flush_packet(context->pb);
}

Where the variables are like: 变量如：

frameData is the decoded frame data, that came from the camera, it was decoded in a previous step and videoOutBuf is a plain uint8_t buffer for holding the data frameData是来自摄像机的已解码帧数据，在上一步中已解码，而videoOutBuf是用于保存数据的普通uint8_t缓冲区

I have modified the application in order to not to decode the frames, but simply pass through the data like: 我修改了应用程序，以便不对帧进行解码，而只是像这样传递数据：

    AVPacket packet;
    av_init_packet(&packet);

    packet.stream_index = videoStream->index;
    packet.data = (uint8_t*)frameData;
    packet.size = currentFrameSize;

    av_interleaved_write_frame(context, &packet);
    put_flush_packet(context->pb);

where 哪里

frameData is the raw H264 frame and currentFrameSize is the size of the raw H264 frame, ie. frameData是原始H264帧，而currentFrameSize是原始H264帧的大小，即。 the number of bytes I get from the gadget for every frame. 我从小工具获取的每一帧的字节数。

And suddenly the application is not working correctly anymore, the produced video is unplayable. 突然，应用程序无法正常运行，产生的视频无法播放。 This is obvious, since I was not setting a correct PTS for the packet. 这很明显，因为我没有为数据包设置正确的PTS。 What I did was the following (I'm desperate, you can see it from this approach :) ) 我所做的是以下操作（我很拼命，您可以从这种方法中看到它:)）

    packet.pts = timestamps[timestamp_counter ++];

where timestamps is actually a list of PTS's produced by the working code above, and written to a file (yes, you read it properly, I logged all the PTS's for a 10 minute session, and wanted to use them). timestamps实际上是上面工作代码生成的PTS的列表，并写入文件中（是的，您可以正确地阅读它，我记录了所有PTS的会话时间为10分钟，并希望使用它们）。

The application still does not work. 该应用程序仍然无法正常工作。

Now, here I am without any clue what to do, so here is the question: 现在，我不知道该怎么办，所以这是一个问题：

I would like to create an "mpegts" video stream using libav functions, insert in the stream already encoded video frames and create a video file with it. 我想使用libav函数创建“ mpegts”视频流，将已编码的视频帧插入流中，并使用它创建视频文件。 How do I do it? 我该怎么做？

Thanks, f. 谢谢，f。

Answer 1

I believe if you set the following, you will see video playback. 我相信，如果您设置以下内容，将会看到视频播放。

packet.flags |= AV_PKT_FLAG_KEY;
packet.pts = packet.dts = 0;

You should really set packet.flags according to the h264 packet headers. 您实际上应该根据h264数据包标头设置packet.flags。 You might try this fellow stack overflowian's suggestion for extracting directly from the stream. 您可以尝试使用此栈溢出专家的建议直接从流中提取。

If you are also adding audio, then pts/dts is going to be more important. 如果您还添加音频，则pts / dts将变得更加重要。 I suggest you study this tutorial 我建议你学习本教程

EDIT 编辑

I found time to extract out what is working for me from my test app. 我有时间从测试应用程序中提取对我有用的东西。 For some reason, dts/pts values of zero works for me, but values other than 0 or AV_NOPTS_VALUE do not. 出于某种原因，零的dts / pts值对我有效，但0或AV_NOPTS_VALUE以外的值无效。 I wonder if we have different versions of ffmpeg. 我想知道我们是否有不同版本的ffmpeg。 I have the latest from git://git.videolan.org/ffmpeg.git . 我有最新的git：//git.videolan.org/ffmpeg.git 。

fftest.cpp fftest.cpp

#include <string>

#ifndef INT64_C
#define INT64_C(c) (c ## LL)
#define UINT64_C(c) (c ## ULL)
#endif

//#define _M
#define _M printf( "%s(%d) : MARKER\n", __FILE__, __LINE__ )

extern "C"
{
    #include "libavcodec/avcodec.h"
    #include "libavformat/avformat.h"
};


AVFormatContext *fc = 0;
int vi = -1, waitkey = 1;

// < 0 = error
// 0 = I-Frame
// 1 = P-Frame
// 2 = B-Frame
// 3 = S-Frame
int getVopType( const void *p, int len )
{   
    if ( !p || 6 >= len )
        return -1;

    unsigned char *b = (unsigned char*)p;

    // Verify NAL marker
    if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
    {   b++;
        if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
            return -1;
    } // end if

    b += 3;

    // Verify VOP id
    if ( 0xb6 == *b )
    {   b++;
        return ( *b & 0xc0 ) >> 6;
    } // end if

    switch( *b )
    {   case 0x65 : return 0;
        case 0x61 : return 1;
        case 0x01 : return 2;
    } // end switch

    return -1;
}

void write_frame( const void* p, int len )
{
    if ( 0 > vi )
        return;

    AVStream *pst = fc->streams[ vi ];

    // Init packet
    AVPacket pkt;
    av_init_packet( &pkt );
    pkt.flags |= ( 0 >= getVopType( p, len ) ) ? AV_PKT_FLAG_KEY : 0;   
    pkt.stream_index = pst->index;
    pkt.data = (uint8_t*)p;
    pkt.size = len;

    // Wait for key frame
    if ( waitkey )
        if ( 0 == ( pkt.flags & AV_PKT_FLAG_KEY ) )
            return;
        else
            waitkey = 0;

    pkt.dts = AV_NOPTS_VALUE;
    pkt.pts = AV_NOPTS_VALUE;

//  av_write_frame( fc, &pkt );
    av_interleaved_write_frame( fc, &pkt );
}

void destroy()
{
    waitkey = 1;
    vi = -1;

    if ( !fc )
        return;

_M; av_write_trailer( fc );

    if ( fc->oformat && !( fc->oformat->flags & AVFMT_NOFILE ) && fc->pb )
        avio_close( fc->pb ); 

    // Free the stream
_M; av_free( fc );

    fc = 0;
_M; 
}

int get_nal_type( void *p, int len )
{
    if ( !p || 5 >= len )
        return -1;

    unsigned char *b = (unsigned char*)p;

    // Verify NAL marker
    if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
    {   b++;
        if ( b[ 0 ] || b[ 1 ] || 0x01 != b[ 2 ] )
            return -1;
    } // end if

    b += 3;

    return *b;
}

int create( void *p, int len )
{
    if ( 0x67 != get_nal_type( p, len ) )
        return -1;

    destroy();

    const char *file = "test.avi";
    CodecID codec_id = CODEC_ID_H264;
//  CodecID codec_id = CODEC_ID_MPEG4;
    int br = 1000000;
    int w = 480;
    int h = 354;
    int fps = 15;

    // Create container
_M; AVOutputFormat *of = av_guess_format( 0, file, 0 );
    fc = avformat_alloc_context();
    fc->oformat = of;
    strcpy( fc->filename, file );

    // Add video stream
_M; AVStream *pst = av_new_stream( fc, 0 );
    vi = pst->index;

    AVCodecContext *pcc = pst->codec;
_M; avcodec_get_context_defaults2( pcc, AVMEDIA_TYPE_VIDEO );
    pcc->codec_type = AVMEDIA_TYPE_VIDEO;

    pcc->codec_id = codec_id;
    pcc->bit_rate = br;
    pcc->width = w;
    pcc->height = h;
    pcc->time_base.num = 1;
    pcc->time_base.den = fps;

    // Init container
_M; av_set_parameters( fc, 0 );

    if ( !( fc->oformat->flags & AVFMT_NOFILE ) )
        avio_open( &fc->pb, fc->filename, URL_WRONLY );

_M; av_write_header( fc );

_M; return 1;
}

int main( int argc, char** argv )
{
    int f = 0, sz = 0;
    char fname[ 256 ] = { 0 };
    char buf[ 128 * 1024 ];

    av_log_set_level( AV_LOG_ERROR );
    av_register_all();

    do
    {
        // Raw frames in v0.raw, v1.raw, v2.raw, ...
//      sprintf( fname, "rawvideo/v%lu.raw", f++ );
        sprintf( fname, "frames/frame%lu.bin", f++ );
        printf( "%s\n", fname );

        FILE *fd = fopen( fname, "rb" );
        if ( !fd )
            sz = 0;
        else
        {
            sz = fread( buf, 1, sizeof( buf ) - FF_INPUT_BUFFER_PADDING_SIZE, fd );
            if ( 0 < sz )
            {
                memset( &buf[ sz ], 0, FF_INPUT_BUFFER_PADDING_SIZE );          

                if ( !fc )
                    create( buf, sz );

                if ( fc )
                    write_frame( buf, sz );

            } // end if

            fclose( fd );

        } // end else

    } while ( 0 < sz );

    destroy();
}

Answer 2

You can create a process to call ffmpeg from console. 您可以创建一个从控制台调用ffmpeg的过程。

Example of command line for processing files like 000001.jpg, 000002.jpg, 000003.jpg, ... 用于处理000001.jpg，000002.jpg，000003.jpg等文件的命令行示例

ffmpeg -ic:\\frames\\%06d.jpg -r 16 -vcodec mpeg4 -an -yc:\\video\\some_video.avi ffmpeg -ic：\\ frames \\％06d.jpg -r 16 -vcodec mpeg4 -an -yc：\\ video \\ some_video.avi

Other examples from ffmpeg docs ffmpeg文档的其他示例

使用libavcodec在mpegts容器中的原始H264帧

问题描述

2 个解决方案

解决方案1
27 已采纳 2011-05-16 19:36:04

解决方案2
-9 2011-05-22 16:42:44

使用libavcodec在mpegts容器中的原始H264帧

问题描述

2 个解决方案

解决方案1 27 已采纳 2011-05-16 19:36:04

解决方案2 -9 2011-05-22 16:42:44

解决方案1
27 已采纳 2011-05-16 19:36:04

解决方案2
-9 2011-05-22 16:42:44