Not logged in
Talk
Contributions
Create account
Log in
Navigation
Main page
About
People
Publications
Teaching
Resources
Research Blog
Wiki Functions
Recent changes
Help
Licensing
Page
Discussion
Edit
View history
Editing
Twitter words of warning
From CommunityData
Jump to:
navigation
,
search
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
Twitter research can be heaps of fun, but it does have some pitfalls. Here are a few things to keep in mind... '''You are what you sample.''' When we write a python script to collect Twitter data we are only building one part of a scientific collection instrument. Twitter provides the rest. Not knowing how your collection instrument works is a problem for a scientific researcher. For example, we can’t be sure when we run a collection that we got everything that we requested. We don’t know what’s missing. Things might be missing because Twitter has a policy on how they release information. Things might be missing because there is something funny going on in Twitter’s technical systems. Either way, we scientists working outside of Twitter can not interrogate Twitter’s side of the collection directly. '''Lots of people use Twitter. Lots don’t. And we don’t know the difference.''' There are heaps and piles and mounds of research that tells us that so far, no single socio-technical system is used by every person on earth. Rather, every communication has its own set of users. Twitter is a company. Who Twitter users are and how those users might be the same or different to any other group is proprietary information. Be wary of generalizations made between Twitter users and any other group such as populations are large. '''Love the rainbow. Fear the rainbow.''' The fun of doing research on Twitter is that there is such so much heterogeniety. Twitter breaks up the account and Tweet data into over 200 different categories. Many of these categories are themselves hugely diverse. '''Scientific Twitter research = Big work, small claims.''' For the reasons above, expect to do a lot of leg work to get meaningful insights out of Twitter data. Also expect that those insights may be very circumspect. '''Don’t go easy on other Twitter researchers'''. The above advice might seem like Research 101 advice, we see these mistakes over and over in published papers. [[Category:CDSW]]
Summary:
Please note that all contributions to CommunityData are considered to be released under the Attribution-Share Alike 3.0 Unported (see
CommunityData:Copyrights
for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource.
Do not submit copyrighted work without permission!
To protect the wiki against automated edit spam, we kindly ask you to solve the following CAPTCHA:
Cancel
Editing help
(opens in new window)
Tools
What links here
Related changes
Special pages
Page information