MCAP Format Registry

This document describes well-known values for MCAP files.

Compression formats

The Chunk compression field may contain the following options:

lz4: an algorithm that prioritizes compression/decompression speed over compression ratio.
zstd: an algorithm with a tunable compression ratio/speed tradeoff.

Message encodings

The Channel message_encoding field describes the encoding for all messages within a channel. This field is mandatory.

ros1

message_encoding: ros1

cdr

message_encoding: cdr (used in ROS 2)

protobuf

message_encoding: protobuf

flatbuffer

message_encoding: flatbuffer

cbor

message_encoding: cbor

msgpack

message_encoding: msgpack

json

message_encoding: json

Schema encodings

The Schema encoding field describes the encoding of a Channel's schema. Typically, this is related to the Channel's message_encoding, but they are separate concepts (e.g. there are multiple schema languages for json).

This field is required for some message encodings (e.g. protobuf) and optional for others (e.g. json).

(empty string)

Schema encoding may only be omitted for self-describing message encodings such as json.

name: May contain any value
encoding: (empty string)
data: Must be empty (0 bytes)

protobuf

name: Fully qualified name to the message within the descriptor set. For example, in a proto file containing package foo.bar; message Baz {} the fully qualified message name is foo.bar.Baz.
encoding: protobuf
data: A binary FileDescriptorSet as produced by protoc --include_imports --descriptor_set_out.

flatbuffer

name: Fully qualified name corresponding to a name in the reflection Object table. For example in a schema containing namespace foo; table Bar {} the fully qualified name is foo.Bar.
encoding: flatbuffer
data: A reflection.Schema flatbuffer describing the parsed schema encoded as a binary flatbuffer. This can be generated by running flatc -b --schema on the fbs file.

ros1msg

name: A valid package resource name, e.g. sensor_msgs/PointCloud2.
encoding: ros1msg
data: Delimited concatenated ROS1 .msg files.

ros1msg Data Format

the data field contains the concatenated .msg file content that is sent in the ROS subscription connection header for this message type.

The top-level message definition is present first, with no delimiter. All dependent .msg definitions are preceded by a two-line delimiter:

One line containing exactly 80 = characters
One line containing MSG: <package resource name> for that type. The space between MSG: and the package resource name is mandatory. The package resource name does not include a file extension.

This format can be reproduced using gendeps --cat.

ros2msg

name: A valid package resource name, e.g. sensor_msgs/msg/PointCloud2.
encoding: ros2msg
data: Delimited concatenated ROS2 .msg files

ros2msg Data Format

The .msg definition is stored alongside its dependencies in the same format as ros1msg.

ros2idl

name: A valid package resource name, e.g. sensor_msgs/msg/PointCloud2
encoding: ros2idl
data: Delimited concatenated ROS2 .idl files

ros2idl Data Format

The IDL definition of the type specified by name along with all dependent types are stored together. The IDL definitions can be stored in any order. Every definition is preceded by a two-line delimiter:

a line containing exactly 80 = characters, then
A line containing only IDL: <package resource name> for that definition. The space between IDL: and the package resource name is mandatory. The package resource name does not include a file extension.

omgidl

The omgidl schema encoding stores OMG IDL definitions for (X)CDR-encoded messages.

name: The globally-scoped name of the message type, eg. top_level_module::my_module::MyType
encoding: omgidl
data: valid OMG IDL definition file contents with a definition for the message type and all referenced types.

Producing `schema.data` for the `omgidl` schema encoding

Most commonly, IDL definitions for separate types are stored in separate files, and use C++ preprocessor #include statements to include definitions for types referenced in that definition.

For the omgidl schema encoding, the data field needs to contain all of those definitions. In other words, the data field should read as a single source file containing definitions for the message type and all referenced types.

One way to produce this is to use a C++ compiler to resolve those #include directives:

# Run from the root of your IDL definition tree, or use additional `-I` arguments to indicate your
# full include path to the compiler.
# The `gcc` output may contain debugging lines such as "# 1 "top_level_module/my_module/MyType.idl",
# the grep command here is intended to filter those out.
$ gcc -E -I. top_level_module/my_module/MyType.idl | grep -v "^# \d" > all_definitions.idl

For the following files:

top_level_module/my_module/MyType.idl
#include "top_level_module/other_module/OtherType.idl"

module top_level_module {
  module my_module {
    struct MyType {
      top_level_module::other_module::OtherType val;
    }
  }
}

top_level_module/other_module/OtherType.idl
module top_level_module {
  module other_module {
    struct OtherType {
      float val;
    }
  }
}

This should produce:

schema.data
module top_level_module {
  module other_module {
    struct OtherType {
      float val;
    }
  }
}

module top_level_module {
  module my_module {
    struct MyType {
      top_level_module::other_module::OtherType val;
    }
  }
}

For this example, schema.name should be set to top_level_module::my_module::MyType.

jsonschema

name: May contain any value
encoding: jsonschema
data: JSON Schema

Profiles

ROS1

The ros1 profile describes how to create MCAP files for ROS 1.

profile: MUST be ros1

Channel

message_encoding: MUST be ros1
metadata keys:
- callerid (optional, string)
- latching (optional, bool stringified as "true" or "false")

Schema

encoding: MUST be ros1msg

ROS2

The ros2 profile describes how to create MCAP files for ROS 2.

Header

profile: MUST be ros2

Channel

message_encoding: MUST be cdr

metadata:

offered_qos_profiles (required, string) YAML formatted sequence of QoS policy values. See ROS QoS Policies for Recording docs for more details. The policy value may change depending on the ROS distro. Below is an example for a single policy schema value.

history: [keep_all, keep_last]
depth: int
reliability: [system_default, reliable, best_effort, unknown]
durability: [system_default, transient_local, volatile, unknown]
deadline:
  sec: int
  nsec: int
lifespan:
  sec: int
  nsec: int
liveliness: [system_default, automatic, manual_by_topic, unknown]
liveliness_lease_duration:
  sec: int
  nsec: int
avoid_ros_namespace_conventions: [true, false]

Schema

encoding: MUST be either ros2msg or ros2idl

MCAP Format Registry

Compression formats​

Message encodings​

ros1​

cdr​

protobuf​

flatbuffer​

cbor​

msgpack​

json​

Schema encodings​

(empty string)​

protobuf​

flatbuffer​

ros1msg​

ros1msg Data Format​

ros2msg​

ros2msg Data Format​

ros2idl​

ros2idl Data Format​

omgidl​

Producing schema.data for the omgidl schema encoding​

jsonschema​

Profiles​

ROS1​

Header​

Channel​

Schema​

ROS2​

Header​

Channel​

Schema​

Compression formats

Message encodings

ros1

cdr

protobuf

flatbuffer

cbor

msgpack

json

Schema encodings

(empty string)

protobuf

flatbuffer

ros1msg

ros1msg Data Format

ros2msg

ros2msg Data Format

ros2idl

ros2idl Data Format

omgidl

Producing `schema.data` for the `omgidl` schema encoding

jsonschema

Profiles

ROS1

Header

Channel

Schema

ROS2

Header

Channel

Schema