2024-06-21T12:10:52 n10613

100 days of COVID-19 in the Australian Twittersphere

Viewed: 2785

The Australian Twittersphere is a database of tweets from identified Australian accounts, originally set up through the TrISMA project, and now managed by the QUT Digital Observatory. This dataset includes 3.7 million Australian Twitter accounts, with 1.8 billion tweets captured to date. Since the beginning of 2019, there have been about 800,000 new tweets per day, from 100,000 daily active users. The 100 days of COVID-19 in the Australian Twittersphere dataset consists of 2.8 million tweet IDs corresponding to tweets from the Australian Twittersphere that mention the COVID-19 pandemic, either through Coronavirus specific hashtags or keywords. The tweets were created on or after 20 January 2020, and up until 23 May 2020 (the 15 weeks that form the first ‘100 days’ of COVID-19 in Australia). This dataset provides a glimpse of the experiences and attitudes of Australians presently living through this global pandemic. We are all in this together and as such this dataset has been released as rapidly as possible to enable use by the broader research community.

The SQL used to extract the tweets from the Australian Twittersphere database is as follows:

SELECT tweet_id
FROM oz_twitter.tweet
WHERE created_at >= '2020-01-19 14:00:00' -- corresponds to >= '2020-01-20 00:00:00' in Brisbane time
AND created_at < '2020-05-03 14:00:00' -- corresponds to < '2020-05-04 00:00:00' in Brisbane time
AND multiMatchAny(lower(text), ['covid', 'corona', 'flattenthecurve', 'socialdistancing', 'stayhome', 'lockdown', 'wuhan', 'pandemic'])
AND notEmpty(hashtags) = 1;

Access to the Australian Twittersphere database is managed by the QUT Digital Observatory.

Access rights

This dataset consists of three .xlsx files of Tweet IDs. Use of this dataset to rehydrate tweets is solely for non-commercial research purposes and subject to Twitter's terms, including: Twitter Terms of Service, Privacy Policy, Developer Agreement and Policy.
It is also a condition of use of the dataset that you provide attribution of the dataset to the Digital Observatory.

Geographical area of data collection

text
No direct geographical area is associated with the data. However, the tweets are posted by Twitter accounts that were identified as 'Australian' in 2016.

Research areas

Digital media
Australian Twittersphere
Tweet
Australia
COVID-19
Coronavirus
Twitter
Pandemic
Social media

Cite this collection

Digital Observatory (2020): 100 days of COVID-19 in the Australian Twittersphere. Queensland University of Technology. (Dataset) https://doi.org/10.25912/5f337d9970d8d

Related information

100 days of COVID-19 in the Australian Twittersphere https://100daysofcovid.do-apps.qut.edu.au/

Data file types

Data files are in .csv format.

Licence

Access rights to the dataset are detailed in the Access Rights section.

The SQL used to extract the tweets from the Australian Twittersphere database is distributed under a CC-BY 4.0 International Licence.

Copyright

Access rights to the dataset are detailed in the Access Rights section.

The SQL used to extract the tweets from the Australian Twittersphere database is distributed under a CC-BY 4.0 International Licence.

Dates of data collection

From 2020-01-20 to 2020-05-03

Connections

Has association with
Australian Twittersphere  (Collection)
Has chief investigator
Is output of

Contacts

Other

Date record created:
2020-06-19T10:19:22
Date record modified:
2024-06-21T12:10:52
Record status:
Published - Open Access