AWS VOD Streaming - Convert Video with AWS Lambda & MediaConvert

April 9, 2022

In previous post, we successfully uploaded video to S3 bucket with multipart upload and presigned url. Next thing we need to do is transcoding uploaded video to make it optimal to stream.

This is the process we'll implement in this post:

Create a Lambda function to create a MediaConvert Job when video is uploaded.
After the job is finished, converted video is stored in S3 converted bucket.

Part 1 - Implement Video Convert Workflow

Let's start building a video converting process.

Create S3 Bucket

First, create a S3 bucket to store converted videos.

Go to S3 console and click Create bucket.
Enter Bucket name (eg. "aws-vod-streaming-converted").
Configure the public access settings to block all since we'll access to this bucket through CloudFront.

MediaConvert Role

Before we create Lambda function, we need some IAM roles to grant permissions to Lambda so that it can work with other AWS services.

MediaConvert needs permissions to read and write files from S3 bucket and genereate CloudWatch events. Therefore, let's create a role for it so that it can be attached to Lambda function.

Go to IAM console and create a new role.
Select AWS service for trusted entity type and find MediaConvert in use case for other AWS services.
Click next button. It will automatically attach AmazonS3FullAccess and AmazonAPIGatewayInvokeFullAccess policies.
Enter a role name (eg. "VOD-MediaConvertRole") and click Create role.

Create Lambda function (Job Submit)

Go to AWS Lambda console and create function.
Choose Node.js environment for Runtime.
In Permissions tab, choose default setting which creates a new execution role for Lambda function.

Update Execution Role (Job Submit)

After you create function, you need to update the Lambda execution role to run other AWS services in your function. With the MediaConvert role we've just created, we'll create another role for Lambda function itself.

In Lambda console of your function, go to Configuration tab and choose Permissions.
You can see a default execution role which is created automatically. Click a role name to redirect to IAM console to update the role.
There is a AWSLambdaBasicExecutionRole policy as a default. Click Add permissions button and choose Create inline policy.
In JSON tab, copy/paste following config:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Action": [
        "logs:CreateLogGroup",
        "logs:CreateLogStream",
        "logs:PutLogEvents"
      ],
      "Resource": "*",
      "Effect": "Allow",
      "Sid": "Logging"
    },
    {
      "Action": [
        "iam:PassRole"
      ],
      "Resource": [
        "MEDIACONVERT_ROLE_ARN"
      ],
      "Effect": "Allow",
      "Sid": "PassRole"
    },
    {
      "Action": [
        "mediaconvert:*"
      ],
      "Resource": [
        "*"
      ],
      "Effect": "Allow",
      "Sid": "MediaConvertService"
    },
    {
      "Action": [
        "s3:*"
      ],
      "Resource": [
        "*"
      ],
      "Effect": "Allow",
      "Sid": "S3Service"
    }
  ]
}

Replace the MEDIACONVERT_ROLE_ARN to the arn of MediaConvert role we've just created. This would allow you to create a MediaConvert job in your Lambda function.

Update Code (Job Submit)

After created function, it's time to write some code. You can also find it in Github. Upload it as zip file, or just copy/paste directly in Lambda console.

In index.js file, which is the main handler, we extract the object key and bucket name from event object. Then configure input/output path and generate metadata which will be used later when the job completed.

index.js
const path = require('path');
const fs = require('fs');

const { updateJobSettings, createJob } = require('./lib/job');

exports.handler = async (event) => {
  console.log(event);

  const { MEDIA_CONVERT_ROLE, MEDIA_CONVERT_ENDPOINT, DESTINATION_BUCKET } =
    process.env;

  /**
   * Define inputs/ouputs, metadata.
   */
  const srcKey = decodeURIComponent(event.Records[0].s3.object.key).replace(
    /\+/g,
    ' '
  );
  const srcBucket = decodeURIComponent(event.Records[0].s3.bucket.name);

  const { dir, name } = path.parse(srcKey);
  const inputPath = `s3://${srcBucket}/${srcKey}`;
  const outputPath = `s3://${DESTINATION_BUCKET}/${dir}/`;

  const jobMetadata = {};

  jobMetadata['id'] = name;
  jobMetadata['key'] = srcKey;

  let job;

  /**
   * Get job settings
   */
  job = JSON.parse(fs.readFileSync('jobs/video.json'));

  const thumbnailJob = JSON.parse(fs.readFileSync('jobs/thumbnail.json'));

  job.Settings.OutputGroups.push(thumbnailJob);

  /**
   * Parse settings file to update source / destination
   */
  job = await updateJobSettings(
    job,
    inputPath,
    outputPath,
    jobMetadata,
    MEDIA_CONVERT_ROLE
  );

  /**
   * Submit Job
   */
  await createJob(job, MEDIA_CONVERT_ENDPOINT);

  return;
};

In lib folder, we have job.js file which updates job settings dynamically and submit job to MediaConvert endpoint.

lib/jobs.js
const AWS = require('aws-sdk');

exports.updateJobSettings = async (
  job,
  inputPath,
  outputPath,
  metadata,
  role
) => {
  try {
    job.Settings.Inputs[0].FileInput = inputPath;
    job.UserMetadata = { ...job.UserMetadata, ...metadata };
    job.Role = role;

    const outputGroups = job.Settings.OutputGroups;

    for (let group of outputGroups) {
      switch (group.OutputGroupSettings.Type) {
        case 'FILE_GROUP_SETTINGS':
          group.OutputGroupSettings.FileGroupSettings.Destination = outputPath;
          break;
        case 'HLS_GROUP_SETTINGS':
          group.OutputGroupSettings.HlsGroupSettings.Destination = outputPath;
          break;
        case 'DASH_ISO_GROUP_SETTINGS':
          group.OutputGroupSettings.DashIsoGroupSettings.Destination =
            outputPath;
          break;
        case 'MS_SMOOTH_GROUP_SETTINGS':
          group.OutputGroupSettings.MsSmoothGroupSettings.Destination =
            outputPath;
          break;
        case 'CMAF_GROUP_SETTINGS':
          group.OutputGroupSettings.CmafGroupSettings.Destination = outputPath;
          break;
        default:
          throw Error(
            'OutputGroupSettings.Type is not a valid type. Please check your job settings file.'
          );
      }
    }

    if (!('AccelerationSettings' in job)) {
      job.AccelerationSettings = 'PREFERRED';
    }

    if (job.Queue && job.Queue.split('/').length > 1) {
      job.Queue = job.Queue.split('/')[1];
    }
  } catch (err) {
    console.log(err);
  }
  return job;
};

exports.createJob = async (job, endpoint) => {
  console.log('Creating Job...');

  const mediaConvert = new AWS.MediaConvert({ endpoint });

  try {
    await mediaConvert.createJob(job).promise();

    console.log('Job has submitted to MediaConvert.');
  } catch (err) {
    console.log(err);
  }
  return;
};

You can find job configuration files in here. In jobs folder, we have two files for different jobs, one for converting source video and one for generating thumbnail image. Converted video will be in CMAF format with 3 different resolutions - 1080p, 720p, 540p.

Optionally, you can create your own job template file from MediaConvert console. You can also extract these configuration files and store in S3 bucket, then read them with aws-sdk built in Lambda.

After updating code, add environment variables. We'll use environment variables for converted bucket, MediaConvert endpoint and MediaConvert role which we created earlier. You can find your MediaConvert endpoint in Account menu of MediaConvert console.

Add Trigger (Job Submit)

Now we have final step of creating first Lambda function. We need to add trigger to run Lambda. We want to run this function whenever the upload is finished.

Click Add trigger in the Lambda console.
Find S3 as a trigger and select source bucket.
Choose Multipart upload completed as an Event type.
Enter videos/ for Prefix.

Part 2 - Handle Job Complete Event

By far, we've implemented converting process. Whenever the video is uploaded, Lambda function is triggered to submit job to MediaConvert. Then converted video and generated thumbnail will be stored in converted bucket.

However, the data we stored in DynamoDB is not reflecting the correct url. Currently it points to mp4 file. Thanks to this setup, user can still access video content with source url while convert job is progressing. But after convert job is finished, we want url to points to converted file.

We can achieve this by creating another Lambda function. But this time, the trigger event will be EventBridge.

Create Lambda Function (Job Complete)

Go to Lambda console and create another function. We simply are going to take a same steps as we did before.

Go to AWS Lambda console and create function.
Choose Node.js environment for Runtime.
In Permissions tab, choose default setting which creates a new execution role for Lambda function.

Update Execution Role (Job Complete)

This time, we need to access DynamoDB instead of MediaConvert.

In Lambda console of your function, go to Configuration tab and choose Permissions.
You can see a default execution role which is created automatically. Click a role name to redirect to IAM console to update the role.
There is a AWSLambdaBasicExecutionRole policy as a default. Click Add permissions button and choose Create inline policy.
In JSON tab, copy/paste following config:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "VisualEditor0",
      "Effect": "Allow",
      "Action": "dynamodb:UpdateItem",
      "Resource": "DYNAMODB_TABLE_ARN"
    }
  ]
}

Replace DYNAMODB_TABLE_ARN to your DynamoDB Table arn.

Add Trigger (Job Complete)

This Lambda function will be triggered when the MediaConvert job is completed. To listen to complete event of MediaConvert, we can leverage EventBridge service. EventBridge allows us to run automated job based on various AWS event patterns as well as schedule expression.

Go to Eventbridge console and create a new rule. You can configure event and target.

Enter rule name (eg. VOD-ConvertCompleted).
Select Rule with an event pattern for Rule type and click Next.
Select AWS events or EventBridge partner events for Event source.
In Event pattern, select AWS services for Event source, and find MediaConvert for AWS service.
Select MediaConvert Job State Change for Event type.
Select Specific state(s) and check COMPLETE. Then click Next.
Choose AWS service for Target types and select Lambda function.
Find a Lambda function you've just created and select it.

After created event rule, you can find it automatically attached as a trigger in Lambda console.

Update Code (Job Complete)

The code of this Lambda function is quite simple. When we converted video, we saved id and key field in metadata. We can retrieve this metadata from event object. With that, you can update item with converted file extension.

index.js
const AWS = require('aws-sdk');

const dynamoClient = new AWS.DynamoDB.DocumentClient();

exports.handler = async (event) => {
  console.log(event);

  const { TABLE_NAME } = process.env;
  const { id, key } = event.detail.userMetadata;

  const params = {
    TableName: TABLE_NAME,
    Key: { id },
    UpdateExpression: 'set #url = :url',
    ExpressionAttributeNames: { '#url': 'url' },
    ExpressionAttributeValues: { ':url': key.replace(/.[^.]+$/, '.mpd') },
  };

  await dynamoClient.update(params).promise();
};

After deploying code, set the environment variable with your DynamoDB table name.

Testing

After uploading a new video file from our app, you can find a new job created in your MediaConvert console.

When the job is finished, converted files and thumbnails are stored in converted bucket.

The data stored in DynamoDB also updated successfully.

Conclusion

That's all for converting video automatically using AWS Lambda and other services. Now we have more modern way of streaming video with multiple resolutions rather than just serving source video.

Also, the whole process of converting will be executed as an asynchronous invocation (S3 Notification, Eventbridge), which means these processes will not block the workflow of our application.

In the next section, which is a final step, we'll deliver the converted video to client using CloudFront distribution.

You can find the source code of this tutorial in here.