Youtube metadata for videos#
Hint
Consult Metadata evaluation for an explanation of the evaluation system included in the column Int. Data.
CATLISM, 255-262
Data points from youtube-dl
for videos1CATLISM, 255-262
#
Table 5.28 #
Descriptions are adapted from youtube-dl
source code.
Attribute name |
Type |
Int. Data |
Description |
---|---|---|---|
|
string |
PID |
Full name of the video file |
|
number |
PID |
Average audio bitrate in KBit/s |
|
string |
PID |
Name of the audio codec in use |
|
number |
SID + PID |
Age restriction for the video, as an integer (years) (optional) |
|
string |
SID |
A secondary title of the video. (optional) |
|
number |
SID |
Average rating give by users, the scale used depends on the webpage (optional) |
|
array |
[SID + PID] |
An array of strings, each one describing a category that the video falls in, for example [“Sports”, “Berlin”] as assigned by the Youtube system and the content’s creator (optional) |
|
string |
SID |
Full name of the channel the video is uploaded on. Note that channel fields may or may not repeat uploader fields. This depends on a particular extractor. (optional) |
|
string |
PID |
Id of the channel. (optional) |
|
string |
PID |
Full URL to a channel webpage. (optional) |
|
array |
[SID] |
An array of dictionaries containing the start and end time of each video chapter, as defined by the content creator |
|
number |
SID |
The end time of the chapter in seconds (optional) |
|
number |
SID |
The start time of the chapter in seconds (optional) |
|
string |
SID |
The title of the chapter |
|
number |
SID |
Number of comments on the video (optional) |
|
string |
SID |
The creator of the video. (optional) |
|
string |
SID |
Full video description. (optional) |
|
number |
SID |
Number of negative ratings of the video |
|
string |
PID |
An alternative identifier for the video, not necessarily unique, but available before title. Typically, id is something like “4234987”, title “Dancing naked mole rats”, and display_id “dancing-naked-mole-rats” (optional) |
|
number |
SID + PID |
Length of the video in seconds. (optional) |
|
number |
SID |
Time in seconds where the reproduction should end, as specified in the URL. (optional) |
|
string |
PID |
Video filename extension. |
|
string |
CID |
Label of the tool used to extract the data |
|
string |
CID |
Unique ID of the operation conducted by the tool used to extract the data |
|
string |
PID |
Textual description of the format of the content, e.g. ‘160 - 256x144 (144p)’ indicating the internal Youtube number assigned to the format, the size of the content (width and height in pixels) and the resolution of the format |
|
array |
[PID] |
A list of dictionaries for each format available, ordered from worst to best quality. |
|
number |
PID |
Average audio bitrate in KBit/s |
|
string |
PID |
Name of the audio codec in use |
|
number |
PID |
Audio sampling rate in Hertz |
|
string |
PID |
The name of the file container (e.g. mp4) |
|
number |
PID |
The number of parts in which the file is split when transmitted from the Youtube server to the local client |
|
string |
PID |
The extension of the format |
|
number |
PID |
The number of bytes, if known in advance |
|
number |
PID |
An estimate for the number of bytes |
|
string |
PID |
A human-readable description of the format (“mp4 container with h264/opus”). Calculated from the format_id, width, height. and format_note fields if missing. |
|
string |
PID |
A short description of the format (“mp4_h264_opus” or “19”). Technically optional, but strongly recommended. |
|
string |
PID |
Additional info about the format (“3D” or “DASH video”) |
|
number |
PID |
Frame rate of the video |
|
string |
PID |
Base URL for fragments. Each fragment’s path value (if present) will be relative to this URL. |
|
string |
PID |
A list of fragments of a fragmented media. Each fragment entry must contain either an url or a path. If an url is present it should be considered by a client. Otherwise both path and fragment_base_url must be present. |
|
number |
PID |
Height of the video in pixels |
|
array |
[CID] |
An array containing objects with additional HTTP headers (i.e. instructions) that were added to the request made for collecting the data |
|
string |
CID |
Description of the formats requested to the server |
|
string |
CID |
List of the character encodings requested to the server |
|
string |
CID |
List of the compression formats requested to the server |
|
string |
CID |
List of the languages (in two-letter codes, e.g. ‘en’) requested to the server |
|
string |
CID |
The User-Agent (see ‘Crawling and scraping the data’) employed |
|
string |
SID |
Language code, e.g. “de” or “en-US” of the content, as defined by the creator |
|
number |
SID |
The preferred language of the content to be shown to viewers, as set by the creator |
|
string |
PID |
The URL of the manifest file in case of fragmented media (DASH, hls, hds) |
|
bool |
PID |
Whether the server supports download resuming |
|
string |
PID |
Link to the player URL, i.e. the web tool used to play the video |
|
number |
PID |
Order number of this format. If this field is present and not None, the formats get sorted by this field, regardless of all other values. -1 for default (order by other properties), -2 or smaller for less than default. < -1000 to hide the format (if there is another one which is strictly better) |
|
string |
CID |
The protocol used for the actual download, lower-case. “http”, “https”, “rtsp”, “rtmp”, “rtmpe”, “m3u8”, “m3u8_native” or “http_dash_segments”. |
|
number |
PID |
Order number of the video quality of this format, irrespective of the file format. -1 for default (order by other properties), -2 or smaller for less than default. |
|
string |
PID |
Textual description of width and height in pixels |
|
number |
CID |
Order number for the selected video source (quality takes higher priority) -1 for default (order by other properties), -2 or smaller for less than default. |
|
number |
PID |
If given and not 1, indicates that the video’s pixels are not square. |
|
number |
PID |
Average bitrate of audio and video in KBit/s |
|
string |
PID |
The full URL of the video file |
|
number |
PID |
Average video bitrate in KBit/s |
|
string |
PID |
The codec used to encode the video |
|
number |
PID |
Width of the video in pixels |
|
number |
PID |
The number of Frames Per Second of the video |
|
string |
SID |
The full title of the content, as written by the creator |
|
number |
PID |
Height of the video in pixels |
|
string |
PID |
Unique video identifier; this is the code that appears in a Youtube URL |
|
bool |
SID |
Whether this video is a live stream that goes on instead of a fixed-length video. (optional) |
|
string |
SID + PID |
Licence name the video is licenced under. (optional) |
|
number |
SID |
Number of positive ratings of the video (optional) |
|
string |
SID |
Physical location where the video was filmed as set by the creator (optional) |
|
string |
SID |
Name of the playlist the video is part of |
|
string |
PID |
Unique ID of the playlist |
|
number |
SID + PID |
Order number of this video in the playlist it belongs to |
|
string |
SID |
Title of the playlist |
|
string |
SID |
Name of account that uploaded the playlist |
|
string |
PID |
Unique ID of the account that uploaded the playlist |
|
string |
SID |
The date (YYYYMMDD) when the video was released. (optional) |
|
number |
SID |
Number of reposts of the video (optional) |
|
string |
PID |
The video resolution, e.g. 144p |
|
number |
SID |
Time in seconds where the reproduction should start, as specified in the URL. (optional) |
|
bool |
PID |
Whether the video proportions should be kept intact or the video should be resized |
|
array |
[SID] |
The available subtitles as a dictionary in the format {tag: subformats}. “tag” is usually a language code, and “subformats” is a list sorted from lower to higher preference |
|
string |
SID + PID |
The subtitles file contents (optional), where [LL] is a two-letter label identifying the language using ISO 639-1 format |
|
string |
PID |
The extension of the subtitle track format (e.g. SRV3), where [LL] is a two-letter label identifying the language using ISO 639-1 format |
|
string |
PID |
A URL pointing to the subtitles file (optional), where [LL] is a two-letter label identifying the language using ISO 639-1 format |
|
array |
[SID] |
A list of strings each one representing a tag assigned to the video, e.g. [“sweden”, “pop music”] by the creator (optional) |
|
array |
[PID] |
Full URL to a video thumbnail image. |
|
array |
[PID] |
An array of JSON objects containing details for the preview thumbnails |
|
number |
PID |
The size of the thumbnail file in KB |
|
number |
PID |
Height of the thumbnail in pixels |
|
string |
PID |
Thumbnail format internal ID |
|
number |
PID |
Quality of the image using internal descriptions |
|
string |
PID |
Resolution of the video in the format “{width}x{height”}, deprecated) |
|
string |
PID |
Direct link to the thumbnail image |
|
number |
PID |
Width of the thumbnail in pixels |
|
number |
PID |
UNIX timestamp of the moment the video became available. (optional) |
|
string |
SID |
Video title as written by the creator |
|
string |
SID + PID |
Video upload date (YYYYMMDD). If not explicitly set, calculated from timestamp. (optional) |
|
string |
SID |
Full name of the video uploader. (optional) |
|
string |
PID |
Unique ID of the video uploader. (optional) |
|
string |
SID |
Full URL to a personal webpage of the video uploader. (optional) |
|
number |
PID |
Average video bitrate in KBit/s |
|
string |
PID |
Name of the video codec in use |
|
number |
AID |
How many users have watched the video on the platform. (optional) |
|
string |
SID |
The URL to the video webpage, if given to youtube-dl it should allow to get the same result again. (It will be set by YoutubeDL if it’s missing) (e.g. |
|
string |
SID |
The name of the web page contained in the webpage_url data-point, e.g. |
|
number |
PID |
Width of the video in pixels |