CommunityData:Resources: Difference between revisions

From CommunityData
No edit summary
 
(75 intermediate revisions by 11 users not shown)
Line 1: Line 1:
This page collects resources for Community Data Science Collective members.
This page collects resources for Community Data Science Collective members.


If you're new to the collective, check out the [https://wiki.communitydata.science/CommunityData:Introduction_to_CDSC_Resources Introduction to CDSC Resources].
== New Member Resources ==
New members of the CDSC should check out both of the links below to become acquainted with the various resources and tools used within the group.


== Technical documentation and getting setup ==
* [[CommunityData:Introduction to CDSC Resources]]
* [[CommunityData:Onboarding Checklist]]


* [[CommunityData:Backups (nada)]] — Details on what is, and what isn't, backed up from nada.
== Non-Technical Resources ==
 
===Administrative===
* [[Schedule]] — Deadlines, events, and similar
* [[CommunityData:Workshop]] — Weekly workshop sessions for sharing work and getting feedback
* [[CommunityData:Jargon]] — Jargon and Common Shorthand
* [[CommunityData:Logos]] — Like our visual branding, not like λόγος. Although we should always make sure we're good in that department too. very clear pointers. Save yourself the trouble and learn to follow these today!
 
===Students===
* [[CommunityData:Prospective students]] — Some materials we have provided to prospective students and used in organizing Q&A events.
* [[CommunityData:General examinations motivating questions]] — A set of questions borrowed and adapted from [https://www.hcde.washington.edu/turns Jennifer Turns] that are a useful ways to start preparing for general examinations.
 
===Research===
* [[CommunityData:Planning document]] — Details on producing Matsuzaki-style planning documents
* [[CommunityData:Research participant compensation]] — Notes on procedures related to human subjects compensation (e.g. for interview studies)
 
== Communication Infrastructure ==
The CDSC uses three primary modes of internal communication: email lists, IRC channels, and Jitsi video conference calling.
 
===Email lists===
New members of this group will need to subscribe to CDSC email lists to stay up to date with important information like group announcements or review shared materials before meetings and workshops. See the link below on how to subscribe to our email lists:
* [[CommunityData:Email]] — Information on email lists, email aliases and their management.
 
===IRC===
New members should also join the CDSC IRC channel for synchronous communication (live chat) between group members. See the link below on how to connect to our IRC channel and get set up with IRC in general:
* [[CommunityData:IRC]] — How to get set up on our chat system, [[:wikipedia:IRC|IRC]]
 
===Jitsi===
Finally, the CDSC primarily uses Jitsi for video conference calls for weekly group meetings. You will want to learn how to use Jitsi and proper etiquette during video call meetings by checking out the link below:
* [[CommunityData:Jitsi]] — Some etiquette/usability tips for Jitsi, our preferred video conference tool.
 
===Blogs and Wiki Information===
For information on how the CDSC blogs, tweets, posts, etc., check out the links below:
* [[CommunityData:Blog and social media]] — Writing/editing blogposts, tweets, and social media
* [[CommunityData:Blog post schedule]] — What's up next?
* [[CommunityData:Wiki]] — this very wiki that you're readings things on RIGHT NOW
 
== Research Infrastructure ==
 
* [[CommunityData:Code]] — List of software projects maintained by the collective.
* [[CommunityData:Git]] — Getting set up on the git server
* [[CommunityData:Otter.ai]] — Audio-to-text transcription software
* [[CommunityData:Taguette]] — Qualitative coding analysis software
* [[CommunityData:Tmux]] — Using tmux (terminal multiplexer) to keep a persistent session on a server. 
* [[CommunityData:Zotero]] — How to use our shared Zotero directory.
* [[CommunityData:Etherpad]] — We use [[:wikipedia:Etherpad|Etherpad]] for collaborative real-time note-taking and such. This page has some information about that as well details about how to make sure your pad is backedup.
* [[CommunityData:MySQL]] — How to use MySQL databases on Kibo.
* [[CommunityData:Dataverse]] — How to use [https://dataverse.harvard.edu/dataverse/communitydata the Community Data Science Dataverse] within the pdataverse.harvard.edu/ Harvard Dataverse]
 
== Papers, Presentations, and Templates ==
 
Stuff related to getting setup and/or troubleshooting things related to LaTeX and papers:
* [[CommunityData:Automating and Streamlining Walkthrough]] A friendly walkthrough of several of the below technologies
* [[CommunityData:TeX]] — Installing our LaTeX templates
* [[CommunityData:TeX#Using the poster template]] — a little bit of detail on how to find/use the poster template
* [[CommunityData:Beamer]] — Installing/using [[Mako]]'s Beamer templates
* [[CommunityData:Beamer]] — Installing/using [[Mako]]'s Beamer templates
* [[CommunityData:Knitr]] — Using Knitr with Tex to build graphs, tables, insert and format numbers in tex documents. 
* [[CommunityData:Embedding fonts in PDFs]] — <code>ggplot2</code> creates PDFs with fonts that are not embedded which, in turn, causes the ACM to bounce our papers back. This page describes how to fix it.
* [[CommunityData:Build papers]] — Both the TeX and Beamer templates above come along with a Makefile that makes some assumptions about your workflow. Learn about that here.
* [[CommunityData:Build papers]] — Both the TeX and Beamer templates above come along with a Makefile that makes some assumptions about your workflow. Learn about that here.
* [[CommunityData:Code]] — List of software projects maintained by the collective.
* [[CommunityData:LaTeX to Word]] — Some journals require submissions in Word format. Here are some options for doing that.
* [[CommunityData:Email]] — Information on email aliases and their management.
* [[CommunityData:LaTex Diff]] — For an R+R, it's often helpful to create a PDF that shows the changes made. Here's one way to do that.
* [[CommunityData:Embedding fonts in PDFs]] — <code>ggplot2</code> creates PDFs with fonts that are not embedded which, in turn, causes the ACM to bounce our papers back. This page describes how to fix it.
 
* [[CommunityData:Exporting from Python to R]]
A few of us use HTML-based presentation. Information on that is here:
* [[CommunityData:Git]] — Getting set up on the git server
 
* [[CommunityData:Hyak]] — Using the Hyak supercomputer system at UW for research.
* [[CommunityData:reveal.js]] — Using RMarkdown to create reveal.js HTML presentations
* [[CommunityData:Hyak setup]] — Getting an account and getting setup on Hyak.
 
* [[CommunityData:Hyak Spark]] — Documents how to use Spark on Hyak
== Computation, Servers, and HPC ==
 
* [[CommunityData:Compute Overview and Resource Matching]] -- What we have and what it's good for
* [[CommunityData:Hyak]] — Using the Hyak supercomputer system at UW for research (several pages are linked from the top of that page)
* [[CommunityData:Hyak_tutorial]] - Tutorial for new people to learn how to use Hyak.
* [[CommunityData:Kibo]] — Getting started with the Kibo system at NU for research.
* [[CommunityData:MySQL]] — Creating MySQL databases on Kibo
* [[CommunityData:Northwestern VPN]] — Connecting to the Northwestern VPN
* [[CommunityData:Northwestern VPN]] — Connecting to the Northwestern VPN
* [[CommunityData:Jargon]] — Jargon and Common Shorthand
* [[CommunityData:Backups (nada)]] — Details on what is, and what isn't, backed up from nada.
* [[CommunityData:Jitsi]] — Some etiquette/usability tips for Jitsi, our preferred video conference tool.  
* [[CommunityData:When a service is down]]
* [[CommunityData:Knitr]] — Using Knitr with Tex to build graphs, tables, insert and format numbers in tex documents. 
 
== Research and Data ==
 
* [[CommunityData:ORES]] - Using ORES with wikipedia data
* [[CommunityData:ORES]] - Using ORES with wikipedia data
* [[CommunityData:Planning document]] — Details on producing Matsuzaki-style planning documents
* [[CommunityData:reveal.js]] — Using RMarkdown to create reveal.js HTML presentations
* [[CommunityData:TeX]] — Installing our LaTeX templates
* [[CommunityData:Tmux]] — Using tmux (terminal multiplexer) to keep a persistent session on a server. 
* [[CommunityData:Zotero]] — How to use our shared Zotero directory.
* [[CommunityData:Wikia data]] — Documents information about how to get and validate wikia dumps.
* [[CommunityData:Wikia data]] — Documents information about how to get and validate wikia dumps.


== Project Pages ==
Project Pages:
 
* [[CommunityData:Message Walls]] -- Documents information about how to get and validate wikia dumps.
* [[CommunityData:Message Walls]] -- Documents information about how to get and validate wikia dumps.


== Conference Planning ==
== Future Meetings and Conferences ==
* [[CommunityData:CSCW_2019]]
We typically have an annual CDSC retreat; you can see pages for many of them in the Past Meetups section below. They often include workshopping, which historically involved [[CommunityData:Critique and Feedback Session]]s.
 
* 2024 Meetup TBD - As of Dec. 1, 2023, planning for a future retreat (anticipated fall 2024) has not yet started. Come back later!
 
Conferences we often attend include ICA, CHI, CSCW, as well as many others. You can find information about the upcoming iteration of those conferences on their respective websites.
 
== Past Meetups ==
 
Group meetups (CDSC retreats):


== Ongoing and Future Meetings and Meetups ==
* [[CommunityData:Meetup_September_2023]]
* [[CommunityData:CDSC_Meetup_September_2019]]
* [[CommunityData:Meetup October 2022]]
* [https://wiki.communitydata.science/Fall_2021_Retreat Online Retreat October 2021]
* <strike>[[CommunityData:Meetup April 2020]]</strike> (cancelled due to [[COVID]])
* [[CommunityData:Meetup September 2019]]
* [[CommunityData:Meetup March 2019]]
* [[CommunityData:Meetup March 2019]]
== Meetings and Meetups of Past ==
* [[Sociotechnocanonicon|Sociotechnocanonicon Great Books Discussion Series]]
* [[CommunityData:Meetup September 2018]]
* [[CommunityData:Meetup September 2018]]
* [[CommunityData:Meetup April 2018]]
* [[CommunityData:Meetup April 2018]]
* [[CommunityData:Meetup April 2018: Organizational notes]]
* [[CommunityData:Meetup April 2018: Organizational notes]]
* [[CommunityData:Meetup July 2017]]
* [[CommunityData:Meetup July 2017]]
* [[CommunityData:UW Weekly Meeting]]


== Non-technical resources ==
Other meetups:
* [[Schedule]] — Deadlines, events, and similar
 
* [[CommunityData:Logos]] — Like our visual branding, not like λόγος. Although we should always make sure we're good in that department too.
* [[CommunityData:CSCW 2019]]
* [[CommunityData:Advice on writing a background section to an academic paper]] — Once upon a time, Mako and Aaron submitted a paper with a mediocre introduction to a journal. Mac Parks, the editor of that journal at the time, set us straight with some very clear pointers. Save yourself the trouble and learn to follow these today!
* [[Sociotechnocanonicon|Sociotechnocanonicon Great Books Discussion Series]]
* [[Community Data Science Lab (UW)]] — Directions to the lab space at UW. This is something you can share with visitors.


== University of Washington Resources ==
== University of Washington Resources ==
Line 56: Line 124:
* [[CommunityData:Related seminars at UW]]
* [[CommunityData:Related seminars at UW]]
* [[CommunityData:IRB training for Scratch Research at UW]]
* [[CommunityData:IRB training for Scratch Research at UW]]
* [[CommunityData:UW NetID]]
* [[CommunityData:Reimbursements (UW)]] — Information on the (rather complicated!) process for getting reimbursed at UW
* [[Community Data Science Lab (UW)]] — Directions to the lab space at UW. This is something you can share with visitors.


== Northwestern Resources ==
== Northwestern Resources ==


* [[Community Data Science Lab (NU) Pandemic research plan]] — Pandemic research plan for CDSC NU created as part of Northwestern's response to the COVID-19 pandemic.
* [[CommunityData:NU grant reimbursement]]
* [[CommunityData:NU grant reimbursement]]
* [https://wiki.communitydata.science/User:Aaronshaw/AdvisingOH Aaron's OH sign up]
* [[Community Data Science Lab (NU)]] — Directions to the lab space at NU. This is something you can share with visitors.


== Chat on IRC ==
== Purdue Resources ==
 
A number of us are idling in #communitydata on [http://www.oftc.net OFTC] (irc.oftc.net). IRC is basically a chat system that is similar to Slack in many ways. In fact, it was the inspiration for Slack!
 
To use IRC, you'll need a client. A really good one for folks new to IRC is [https://irccloud.com IRCCloud]. With IRCCloud there is a web interface as well as good Apps for iOS and Android.


One limitation of IRCCloud is that, after a 1-week trial period, the system will disconnect folks every two hours. There are a couple options for this. The easiest one is subscribing to IRC cloud which costs $5/month or $50 a year. You just pay for a year and send the receipt to [[User:Ashaw]] who will pay the bill! If you are totally new to IRC and just want something easy and straight forward, this is our recommendation.
To be populated soon!
 
The other options are more indirect, technical, and/or involve a bit more work or figuring stuff out:
 
;Bouncer:On option is an IRC "bouncer" such as [https://znc.in ZNC].
;IRC Bridge/Matrix:Another option that a few people are doing  is connecting connecting through a service with an IRC bridge. One popular one is the [https://matrix.org Matrix] protocol. While there are numerous [https://matrix.org/docs/projects/try-matrix-now.html#clients clients], [[User:Salt]] recommends [https://riot.im Riot] which is freely supported on every platform. Once you get Matrix Join the room <code>#_oftc_#communitydata:matrix.org</code> to bridge into IRC from Matrix. Get in contact with [[User:Salt]] if you want to go this way.
 
=== Registering your "nick" (i.e., IRC username) ===
 
Due to spam, we have on a couple of occasions in the past had to block "non-registered" users from posting to <code>#communitydata</code>. This helps a lot with the spam but has a big disadvantage in that if you speak on the channel but are not registered, nobody else will be able to hear you!
 
It's a good idea to register your nickname or "nick" in any case because it means that can essentially reserve your nickname so that nobody else can have it.
 
If you've already done this long ago, you can stop reading, there's nothing new to do. If you haven't gotten fully registered and verified, here are the four "easy" steps, heavily mediated by the NickServ bot:
 
# From IRC (irc.oftc.net), <code>/msg NickServ register <<your password>> <<your email>></code>
# Verification is via the website, see: [https://services.oftc.net/]
# Ask NickServ '<code>checkverify</code>' and she'll tell you if you're verified.
# Please say something on the channel to test. If you do not get a response, your messages can still be seen by community members. <code>/msg mako</code> or <code>ashaw</code> for help.


== Diversions ==  
== Diversions ==  
The light in the lab at UW is funny. We have three fluorescent lights. On flipping the light switch, only two turn on. The third turns on ''eventually''. We are studying this arcane phenomenon over at [[CommunityData:Light events]]
* [[CommunityData:Light events]] — The light in the lab at UW is funny. We have three fluorescent lights. On flipping the light switch, only two turn on. The third turns on ''eventually''. We are studying this arcane phenomenon.


== Game IDs ==
* [[CommunityData:GameIDs]] — A directory containing the game IDs for CDSC members to connect with each other across various gaming platforms.
* [[CommunityData:GameIDs]] — A directory containing the game IDs for CDSC members to connect with each other across various gaming platforms.
* [[CommunityData:Lab art]] - A page with sources of ~physical lab decor~
== Organizing ==
* [[CommunityData:Group Tasks]] - A collection of different tasks that benefit the group.
* [[CommunityData:Roles]] - A page describing the current membership roles and expectations around them.

Latest revision as of 06:43, 5 February 2024

This page collects resources for Community Data Science Collective members.

New Member Resources[edit]

New members of the CDSC should check out both of the links below to become acquainted with the various resources and tools used within the group.

Non-Technical Resources[edit]

Administrative[edit]

  • Schedule — Deadlines, events, and similar
  • CommunityData:Workshop — Weekly workshop sessions for sharing work and getting feedback
  • CommunityData:Jargon — Jargon and Common Shorthand
  • CommunityData:Logos — Like our visual branding, not like λόγος. Although we should always make sure we're good in that department too. very clear pointers. Save yourself the trouble and learn to follow these today!

Students[edit]

Research[edit]

Communication Infrastructure[edit]

The CDSC uses three primary modes of internal communication: email lists, IRC channels, and Jitsi video conference calling.

Email lists[edit]

New members of this group will need to subscribe to CDSC email lists to stay up to date with important information like group announcements or review shared materials before meetings and workshops. See the link below on how to subscribe to our email lists:

IRC[edit]

New members should also join the CDSC IRC channel for synchronous communication (live chat) between group members. See the link below on how to connect to our IRC channel and get set up with IRC in general:

Jitsi[edit]

Finally, the CDSC primarily uses Jitsi for video conference calls for weekly group meetings. You will want to learn how to use Jitsi and proper etiquette during video call meetings by checking out the link below:

  • CommunityData:Jitsi — Some etiquette/usability tips for Jitsi, our preferred video conference tool.

Blogs and Wiki Information[edit]

For information on how the CDSC blogs, tweets, posts, etc., check out the links below:

Research Infrastructure[edit]

Papers, Presentations, and Templates[edit]

Stuff related to getting setup and/or troubleshooting things related to LaTeX and papers:

A few of us use HTML-based presentation. Information on that is here:

Computation, Servers, and HPC[edit]

Research and Data[edit]

Project Pages:

Future Meetings and Conferences[edit]

We typically have an annual CDSC retreat; you can see pages for many of them in the Past Meetups section below. They often include workshopping, which historically involved CommunityData:Critique and Feedback Sessions.

  • 2024 Meetup TBD - As of Dec. 1, 2023, planning for a future retreat (anticipated fall 2024) has not yet started. Come back later!

Conferences we often attend include ICA, CHI, CSCW, as well as many others. You can find information about the upcoming iteration of those conferences on their respective websites.

Past Meetups[edit]

Group meetups (CDSC retreats):

Other meetups:

University of Washington Resources[edit]

Northwestern Resources[edit]

Purdue Resources[edit]

To be populated soon!

Diversions[edit]

  • CommunityData:Light events — The light in the lab at UW is funny. We have three fluorescent lights. On flipping the light switch, only two turn on. The third turns on eventually. We are studying this arcane phenomenon.

Organizing[edit]