Twitter metadata#
Hint
Consult Metadata evaluation for an explanation of the evaluation system included in the column Int. Data.
CATLISM, 197-203
Data points from snscrape
1CATLISM, 197-203
#
Table 5.12#
Attribute name |
Type |
Int. Data |
Description |
---|---|---|---|
|
string |
CID |
Internal value added by snscrape, whose value describes the snscrape module used to collect the data (in the case of the twitter-search module, the value is |
|
array |
[SID] |
An array containing the string version of the cashtags included in the tweet, without the $ character |
|
string |
SID |
Content of the tweet with URLs transformed through the Twitter link shortener t.co (see attribute renderedContent) |
|
number |
PID |
The unique ID of the conversation |
|
object |
[SID] |
JSON object containing the geographical coordinates in Decimal Degrees (DD) format |
|
string |
CID |
Internal value added by snscrape, whose value describes the snscrape module used to collect the data |
|
number |
PID |
The latitude of the location from where the tweet was sent |
|
number |
PID |
The longitude of the location from where the tweet was sent |
|
string |
SID |
Date on which the tweet was posted, using the format YYYY-MM-DDTHH:MM:SS+TZ |
|
array |
[SID] |
List of strings containing the hashtags included in the tweet’s contents as strings stripped of the # character; one item for each hashtag |
|
number |
PID |
Tweet unique ID |
|
number |
PID + SID |
Unique ID of the tweet the current tweet replies to |
|
array |
[SID] |
Array of JSON objects, each one containing details about the creator of the tweet the current tweet replies to, formatted according to the same structure as user.* |
|
string |
PID |
The language in which the content of the tweet is written, as identified by Twitter using 2-character ISO 639-1 format |
|
number |
SID |
The number of likes (visually rendered with the image of a heart on Twitter) the tweet has at the time of scraping |
|
array |
[SID] |
Array of JSON objects, each one containing details regarding the attached media file |
|
string |
CID |
Internal value added by snscrape, whose value describes the snscrape module used to collect the data |
|
string |
PID |
For pictures only. Direct link to the full-sized image – appearing when a user clicks on the tweet’s image |
|
string |
PID |
For pictures only. Direct link to the resized image – appearing when viewing the tweet from the web or app interface |
|
string |
PID |
For videos only. Direct link to the video thumbnail |
|
array |
[PID] |
For videos only. Array of JSON objects, each one containing details regarding the different versions Twitter creates for the attached video. |
|
string |
CID |
For videos only. Internal value added by snscrape, whose value describes the snscrape module used to collect the data |
|
number |
PID |
For videos only. The bitrate of the video |
|
string |
PID |
For videos only. The textual description of the format of the file, e.g. “video/mp4” |
|
string |
PID |
For videos only. Direct link to the video |
|
array |
[SID] |
Array of JSON objects composed of items containing details about users included in the tweet’s contents, formatted according to the same structure as user.* |
|
array |
[SID] |
A list of strings, each one representing one of the links included in the tweet’s contents |
|
object |
[SID] |
JSON object containing the details of the place from where the tweet was posted, as identified by Twitter |
|
string |
CID |
Internal value added by snscrape, whose value describes the snscrape module used to collect the data |
|
string |
PID |
Full extended name of the location as identified by Twitter |
|
string |
PID |
Short name of the location as identified by Twitter |
|
string |
PID |
Type of location: e.g. “Country”, “City”, etc… |
|
string |
PID |
The name of the country where the place is located, as identified by Twitter |
|
string |
PID |
Two-letter country code of the country, in ISO 3166-2 format |
|
number |
SID |
The number of times the tweet has been quoted by other tweets at the time of scraping |
|
array |
[SID] |
JSON object composed of items containing details about the quoted tweet and its creator, formatted according to the same structure as the JSON object for the original tweet |
|
string |
PID |
Content (from attribute content) of the tweet as it appears on the web or app interface and formatted without the use of t.co URL shortener |
|
number |
SID |
Numbers of replies the tweet has received at the time of scraping |
|
number |
SID |
The number of times the tweet has been retweeted at the time of scraping |
|
array |
[SID] |
Array of JSON objects containing details about the retweeted tweet and its creator, formatted according to the same structure as the JSON object for the original tweet |
|
string |
PID + SID |
Link in HTML syntax of the application used by the user to post the tweet; for the official Twitter mobile interface this appears as <a href=”https://mobile.twitter.com” rel=”nofollow”>Twitter Web App |
|
string |
PID |
Plain text name of the application used by the user to post the tweet |
|
string |
PID |
Direct link to the used-application website |
|
array |
[PID] |
A list of strings, each one representing one of the links included in the tweet’s contents shortened using the t.co service |
|
string |
PID |
Direct URL to the tweet |
|
object |
[SID] |
JSON object containing details about the account who posted the tweet |
|
string |
CID |
Internal value added by snscrape, whose value described the snscrape module used to collect the data |
|
string |
SID |
Date on which the account was created, using the format YYYY-MM-DDTHH:MM:SS+TZ |
|
string |
SID |
Description associated with the account, as written by the user and as appearing in the web or app interface |
|
array |
[SID] |
JSON object containing details about the URLS included in the account description – one JSON item per URL |
|
array |
[PID] |
List containing the positional number of the first and last character of the URL |
|
string |
PID |
Shortened URL using the t.co service |
|
string |
SID |
Plain text version of the URL, as written by the user |
|
string |
PID + SID |
Full URL version, including http(s):// - if missing from the text version |
|
string |
SID |
The full name associated with the account, as written by the user |
|
number |
SID |
The number of tweets the account has liked at the time of scraping |
|
number |
SID |
The number of followers the account has at the time of scraping |
|
number |
SID |
The number of users the account is following – i.e. friends – at the time of scraping |
|
string |
PID |
The account unique ID |
|
array |
[PID] |
Array of JSON objects, each one containing details regarding the government and state-affiliated media account labels on Twitter |
|
string |
CID |
Internal value added by snscrape, whose value describes the snscrape module used to collect the data |
|
string |
PID |
Direct link to the image appearing, in the Twitter interface, next to the description of the label |
|
string |
PID |
Description appearing on the Twitter interface describing the details of the media account according to government and state-affiliated labels employed by Twitter |
|
string |
PID |
Longer description (if available) of the media account according to government and state-affiliated labels employed by Twitter |
|
string |
PID |
URL to the documentation ‘About government and state-affiliated media account labels on Twitter’ |
|
string |
PID |
Shortened version of the URL the user has indicated in the ‘Website’ field of the account, using t.co |
|
string |
SID |
Plain text version of the URL the user has indicated in the ‘Website’ field of the account |
|
number |
SID |
The number of public lists the account is a member of, at the time of scraping |
|
string |
SID |
User-defined location of the account’s profile |
|
number |
SID |
The number of multimedia contents the account has uploaded on Twitter at the time of scraping |
|
string |
PID |
Direct link to the account’s banner picture |
|
string |
PID |
Direct link to the account’s profile picture |
|
bool |
SID |
Indicates whether the account is public – i.e. anyone can read the user’s tweets, value equal to “false” – or private – only accounts that have been accepted by the user can read their tweets, value equal to “true” |
|
string |
PID |
Description associated with the account, as written by the user with URLs transformed through the Twitter link shortener t.co |
|
number |
SID |
The number of tweets the account has posted at the time of scraping |
|
string |
PID |
Full link to the account’s page, using the username as written by the user |
|
string |
SID |
Username of the account through which the tweet was posted |
|
bool |
SID |
Indicates whether the account is verified by Twitter or not – i.e. if a blue stamp is shown next to the displayname value |