UM Big Data Summer Camp

University of Michigan

Weiser Hall - 500 Church St., Central Campus
June 17th-21st 2019
9:00 am - 5:00 pm

General Information

Social and organizational life are increasingly conducted or tracked online through electronic media, from emails to Twitter feed to dating sites to GPS phone tracking. The traces these activities leave behind have acquired the (misleading) title of “big data.” It is a good bet that within a few years, a standard part of graduate training in the social sciences will include a hefty dose of “how to make use of big data,” just as statistical analysis is a standard part of such training today. The ICOS Big Data Camp aims to make big data accessible for people with no prior background. We want people to leave with enough confidence and basic knowledge to be able to know what is possible in their research and where they might go next, drawing on resources at the University of Michigan. Organizing committee: Elizabeth Bruch, Jerry Davis

Coordinators: Jeffrey W. Lockhart

Instructors: Jeffrey W. Lockhart, TBA

Guides: Jeffrey W. Lockhart, TBA

Speakers: TBA, TBA2, TBA3

Who: The course is aimed at graduate students and other researchers.

Requirements: Participants must bring a laptop with a few specific software packages installed (listed below).

Contact: Please mail jwlock at umich dot edu for more information.


Schedule

Monday 9:00 Introduction with Jerry Davis (Intro ppt, pdf.)
12:00 - 1:00 Lunch
12:15 Lunch Speaker Series
1:00 - 1:30 Lunch Clean Up and Coffee Pick Me Up
3:00 Project Overview and Team Formation
4:00 Team Meetings and Software Check
5:00 Depart
Tuesday9:00 Q & A and Software Check
12:00 - 1:00 Lunch
12:15 Lunch Speaker Series
1:00 - 1:30 Lunch Clean Up and Coffee Pick Me Up
4:00 Group Work
5:00 Depart
Wednesday9:00 Start day
10:45 Break
12:00 - 1:00 Lunch
12:15 Lunch Speaker Series
1:00 - 1:30 Lunch Clean Up and Coffee Pick Me Up
2:30 Break
4:00 Groupwork
5:00 Depart
Thursday9:00 Start Day
10:00 Break
1:00-2:30 Lunch
3:00 HACKATHON BEGINS
5:00 - 10:00 Rooms at Weiser will be available until building closes at 10
Friday 9:00 HACKATHON!
10:00 HACKATHON!
10:00 HACKATHON!
11:00 HACKATHON!
12:00 Lunch - Come get some and keep working
1:00 HACKATHON!
3:00 Presentations Begin
5:00 Feedback
6:00 Celebration at Dominick's

Setup

To participate in the ICOS Big Data Summercamp, you will need working copies of the software described below. Please make sure to install everything (or at least to download the installers) before the start of your bootcamp.

Overview of the tools

Editor

When you're writing code, it's nice to have a text editor that is optimized for writing code, with features like automatic color-coding of key words.

The Bash Shell

Bash is a commonly-used shell. Using a shell gives you more power to do more tasks more quickly with your computer.

Python

Python is becoming very popular in scientific computing, and it's a great language for teaching general programming concepts due to its easy-to-read syntax. Installing all the scientific packages for Python individually can be a bit difficult, so we recommend an all in one installer. One of the best is Anaconda by Continuum Analytics.

Jupyter Notebook

The Jupyter Labs is a browser-based interface for interactive computing. Jupyter Labs has support for over 100 programming languages, including those popular in Data Science such R, Julia, Scala - and most importantly, Python. Individual notebooks are composable, executable, and sharable documents that mix text, code, data, and visualizations. Jupyter Labs comes pre-loaded in Anaconda.

SQL

SQL is a specialized programming language used with databases. SQL is a declarative langauge for describing (declaring) the data you want from the database. We use SQLite Studio for the lessons. There are often memory allocation limits built into most web browsers It is a relatively small application with a very intutitive GUI.

Windows Installation

Python

  • Download and install Anaconda. Very specifically download the graphical installer for the 3.6 version
  • Use all of the defaults for installation except make sure to check Make Anaconda the default Python.

Editor (OPTIONAL)

Notepad++ is a popular free code editor for Windows. (NOTE: Be aware that you must add its installation directory to your system path in order to launch it from the command line or have other tools like Git launch it for you. Please ask a TA to help you with this if you are interested.)

Bash Shell on Windows

  • Windows 10 - Please go to the following website which has great instructions for installing the Bash Shell for Windows.
  • Windows 8 and earlier - Please go to the following website and follow the instructions for installing Cygwin a Unix like environment that will give you access to Bash. The installation is slightly more involved. If you find yourself having trouble insalling it, we will take care of it on the first say of camp.

SQLite Studio

Download SQLite Studio to your desktop and follow the directions for installation.

Mac OS X Installation

Python

  • Download and install Anaconda. Very specifically download the graphical installer for the 3.6 version.
  • Use all of the defaults for installation except make sure to check Make Anaconda the default Python.

Editor (OPTIONAL)

We recommend Sublime Text. In a pinch, you can use nano or vi, which should be pre-installed.

SQLite Studio

Download SQLite Studio to your desktop and follow the directions for installation.