2024-06-21T12:14:48 n41990

Australian Twittersphere Population Rules

Viewed: 212

The Australian Twittersphere is a longitudinal collection of tweets from a population of Twitter accounts which explicitly self-identify as Australian or having connection to Australia. A publication detailing the method used to determine this population is forthcoming. This dataset contains the rules for inclusion in the Australian Twittersphere which are derived by the population method. These rules were developed by the QUT Digital Observatory team in 2021, and then used to determine the Twitter accounts which were included in the Australian Twittersphere up until the collection ceased in 2023.

To use these rules:

The rules file can be used with the twittersphere tool to filter a list of Twitter accounts down to those which match the rules and as such would have been suitable for inclusion in the Australian Twittersphere.

File specification:

aus_ts_rules_2021-09-16.csv is a CSV file encoded as UTF-8, with commas as field separators and each value enclosed in double quotation marks. There is a header row with column titles at the start, and each subsequent row (10,940 in total) represents a single rule in the ruleset.

Columns:

  • include: 1, -1, or blank
    • 1 indicates that this is a positive matching rule
    • -1 indicates that this is a negative matching rule
    • blank: does not affect results, but this rule is retained to indicate that it has already been assessed
  • discuss: Non-blank values indicate that this rule requires further discussion for finalisation
  • note: Any notes
  • field: Which field of the Twitter account this rule is to be applied to. Options: 'location', 'description', or 'realname'
  • first_token: Token to match on to apply the rule
  • second_token: Token to match on to apply the rule
  • third_token: Token to match on to apply the rule

Note that first, second, and third tokens together represent a sequence of up to three tokens appearing in the specified field. If the relevant sequence for the rule is only one or two tokens in length, third_token and possibly second_token will be empty.

Geographical area of data collection

text
Tweets from a population of accounts which explicitly state connection to Australia.

Research areas

Social media
Twitter

Cite this collection

Hames, Sam; Takahashi, Marissa; Miller, Alice; QUT Digital Observatory; (2023): Australian Twittersphere Population Rules. Queensland University of Technology. (Dataset) https://doi.org/10.25912/RDF_1711599518062

Data file types

aus_ts_rules_2021-09-16.csv is a CSV file encoded as UTF-8, with commas as field separators and each value enclosed in double quotation marks. There is a header row with column titles at the start, and each subsequent row (10,940 in total) represents a single rule in the ruleset.

Licence


Creative Commons Attribution 4.0 (CC-BY)
http://creativecommons.org/licenses/by/4.0/

Copyright

© QUT Digital Observatory, 2024.

Dates of data collection

From 2018 to 2021

Connections

Has chief investigator
Is output of

Contacts

Other

Date record created:
2023-04-03T09:43:04
Date record modified:
2024-06-21T12:14:48
Record status:
Published - Open Access