Twitter metadata#

Hint

Consult Metadata evaluation for an explanation of the evaluation system included in the column Int. Data.

1CATLISM, 197-203

Data points from snscrape1CATLISM, 197-203#

Table 5.12#

Attribute name

Type

Int. Data

Description

_type

string

CID

Internal value added by snscrape, whose value describes the snscrape module used to collect the data (in the case of the twitter-search module, the value is snscrape.modules.twitter.Tweet)

cashtags.*

array

[SID]

An array containing the string version of the cashtags included in the tweet, without the $ character

content

string

SID

Content of the tweet with URLs transformed through the Twitter link shortener t.co (see attribute renderedContent)

conversationId

number

PID

The unique ID of the conversation

coordinates.*

object

[SID]

JSON object containing the geographical coordinates in Decimal Degrees (DD) format

coordinates._type

string

CID

Internal value added by snscrape, whose value describes the snscrape module used to collect the data

coordinates.latitude

number

PID

The latitude of the location from where the tweet was sent

coordinates.longitude

number

PID

The longitude of the location from where the tweet was sent

date

string

SID

Date on which the tweet was posted, using the format YYYY-MM-DDTHH:MM:SS+TZ

hashtags.*

array

[SID]

List of strings containing the hashtags included in the tweet’s contents as strings stripped of the # character; one item for each hashtag

id

number

PID

Tweet unique ID

inReplyToTweetId

number

PID + SID

Unique ID of the tweet the current tweet replies to

inReplyToUser.*

array

[SID]

Array of JSON objects, each one containing details about the creator of the tweet the current tweet replies to, formatted according to the same structure as user.*

lang

string

PID

The language in which the content of the tweet is written, as identified by Twitter using 2-character ISO 639-1 format

likeCount

number

SID

The number of likes (visually rendered with the image of a heart on Twitter) the tweet has at the time of scraping

media.*

array

[SID]

Array of JSON objects, each one containing details regarding the attached media file

media._type

string

CID

Internal value added by snscrape, whose value describes the snscrape module used to collect the data

media.fullUrl

string

PID

For pictures only. Direct link to the full-sized image – appearing when a user clicks on the tweet’s image

media.previewUrl

string

PID

For pictures only. Direct link to the resized image – appearing when viewing the tweet from the web or app interface

media.thumbnailUrl

string

PID

For videos only. Direct link to the video thumbnail

media.variants.*

array

[PID]

For videos only. Array of JSON objects, each one containing details regarding the different versions Twitter creates for the attached video.

media.variants.0._type

string

CID

For videos only. Internal value added by snscrape, whose value describes the snscrape module used to collect the data

media.variants.0.bitrate

number

PID

For videos only. The bitrate of the video

media.variants.0.contentType

string

PID

For videos only. The textual description of the format of the file, e.g. “video/mp4”

media.variants.0.url

string

PID

For videos only. Direct link to the video

mentionedUsers.*

array

[SID]

Array of JSON objects composed of items containing details about users included in the tweet’s contents, formatted according to the same structure as user.*

outlinks.*

array

[SID]

A list of strings, each one representing one of the links included in the tweet’s contents

place.*

object

[SID]

JSON object containing the details of the place from where the tweet was posted, as identified by Twitter

place._type

string

CID

Internal value added by snscrape, whose value describes the snscrape module used to collect the data

place.fullName

string

PID

Full extended name of the location as identified by Twitter

place.name

string

PID

Short name of the location as identified by Twitter

place.type

string

PID

Type of location: e.g. “Country”, “City”, etc…

place.country

string

PID

The name of the country where the place is located, as identified by Twitter

place.countryCode

string

PID

Two-letter country code of the country, in ISO 3166-2 format

quoteCount

number

SID

The number of times the tweet has been quoted by other tweets at the time of scraping

quotedTweet.*

array

[SID]

JSON object composed of items containing details about the quoted tweet and its creator, formatted according to the same structure as the JSON object for the original tweet

renderedContent

string

PID

Content (from attribute content) of the tweet as it appears on the web or app interface and formatted without the use of t.co URL shortener

replyCount

number

SID

Numbers of replies the tweet has received at the time of scraping

retweetCount

number

SID

The number of times the tweet has been retweeted at the time of scraping

retweetedTweet.*

array

[SID]

Array of JSON objects containing details about the retweeted tweet and its creator, formatted according to the same structure as the JSON object for the original tweet

source

string

PID + SID

Link in HTML syntax of the application used by the user to post the tweet; for the official Twitter mobile interface this appears as <a href=”https://mobile.twitter.com” rel=”nofollow”>Twitter Web App

sourceLabel

string

PID

Plain text name of the application used by the user to post the tweet

sourceUrl

string

PID

Direct link to the used-application website

tcooutlinks.*

array

[PID]

A list of strings, each one representing one of the links included in the tweet’s contents shortened using the t.co service

url

string

PID

Direct URL to the tweet

user.*

object

[SID]

JSON object containing details about the account who posted the tweet

user._type

string

CID

Internal value added by snscrape, whose value described the snscrape module used to collect the data

user.created

string

SID

Date on which the account was created, using the format YYYY-MM-DDTHH:MM:SS+TZ

user.description

string

SID

Description associated with the account, as written by the user and as appearing in the web or app interface

user.descriptionUrls.*

array

[SID]

JSON object containing details about the URLS included in the account description – one JSON item per URL

user.descriptionUrls.indices.*

array

[PID]

List containing the positional number of the first and last character of the URL

user.descriptionUrls.tcourl

string

PID

Shortened URL using the t.co service

user.descriptionUrls.text

string

SID

Plain text version of the URL, as written by the user

user.descriptionUrls.url

string

PID + SID

Full URL version, including http(s):// - if missing from the text version

user.displayname

string

SID

The full name associated with the account, as written by the user

user.favouritesCount

number

SID

The number of tweets the account has liked at the time of scraping

user.followersCount

number

SID

The number of followers the account has at the time of scraping

user.friendsCount

number

SID

The number of users the account is following – i.e. friends – at the time of scraping

user.id

string

PID

The account unique ID

user.label.*

array

[PID]

Array of JSON objects, each one containing details regarding the government and state-affiliated media account labels on Twitter

user.label._type

string

CID

Internal value added by snscrape, whose value describes the snscrape module used to collect the data

user.label.badgeUrl

string

PID

Direct link to the image appearing, in the Twitter interface, next to the description of the label

user.label.description

string

PID

Description appearing on the Twitter interface describing the details of the media account according to government and state-affiliated labels employed by Twitter

user.label.longDescription

string

PID

Longer description (if available) of the media account according to government and state-affiliated labels employed by Twitter

user.label.url

string

PID

URL to the documentation ‘About government and state-affiliated media account labels on Twitter’

user.linkTcourl

string

PID

Shortened version of the URL the user has indicated in the ‘Website’ field of the account, using t.co

user.linkUrl

string

SID

Plain text version of the URL the user has indicated in the ‘Website’ field of the account

user.listedCount

number

SID

The number of public lists the account is a member of, at the time of scraping

user.location

string

SID

User-defined location of the account’s profile

user.mediaCount

number

SID

The number of multimedia contents the account has uploaded on Twitter at the time of scraping

user.profileBannerUrl

string

PID

Direct link to the account’s banner picture

user.profileImageUrl

string

PID

Direct link to the account’s profile picture

user.protected

bool

SID

Indicates whether the account is public – i.e. anyone can read the user’s tweets, value equal to “false” – or private – only accounts that have been accepted by the user can read their tweets, value equal to “true”

user.rawDescription

string

PID

Description associated with the account, as written by the user with URLs transformed through the Twitter link shortener t.co

user.statusesCount

number

SID

The number of tweets the account has posted at the time of scraping

user.url

string

PID

Full link to the account’s page, using the username as written by the user

user.username

string

SID

Username of the account through which the tweet was posted

user.verified

bool

SID

Indicates whether the account is verified by Twitter or not – i.e. if a blue stamp is shown next to the displayname value