Spring 2018 — ASA DataFest

DataFest has concluded successfully.


We had a great time at the ASA DataFest this past weekend and hope you had, too. We would like to thank everyone involved in organizing the event. It wouldn’t have been a success without their support!

Thanks for everyone who participated in our Fall 2017 DataFest and congratulations to our winning teams!

 

The Significant Five – Best Insight

Members: Xiaojing Dong, Ziyi Ye, Wenyi Tao, Tom Chen, Zhongxing Xue

 

The Magnificent Five – Best Visualization

Members: Fan Yang, Ming Li, Jingtian Yao, Jia Zheng, Qianhui(Karlie) Wan

 

FFF – Best Use of External Data

Members: Yuchen Cai, Shih-yin Chen, Wei Qin, Kuo Yang

 

Spotlight – Judges’ Choice Award

Members: Wenshan Wang, Zonghao Li, Xiaoxiao Guo, Xiuqi Shao, Yanan Wang

 

——————————————————————————————————————————————

The American Statistical Association holds DataFest across the United States each spring and we are excited to host their DataFest 2018 at Columbia University!

Date: April 6, 2018 – April 8, 2018

Location: Math 207

 

All team leaders please send an email to [email protected] with a list of your team members (University ID or email address) and team name.

There are three awards for this competition valued at $1100 total.

Grading is based on a cumulative grade with the following criteria:

Model completeness, Presentation/visualization, Business insight, Best use of external data

The final criteria may change at the judges’ discretion.

There is a requirement to use at least one of the datasets provided.

Make sure to present your methodology and results as best as possible on your slides and presentation.

 

 

 Details:

  • All participants will be grouped into teams of 5 or 6.
  • Each team will receive a real-world dataset on Friday (11/03) night. Teams can develop their own topics / questions of interests to work on.
  • There are about one and a half days to complete a project for each team.
  • Each team is also asked to give a presentation on Sunday afternoon (4/8).
  • Any software and programming language is welcomed to be used for project / presentation.
  • Comments will be given by judges. (Judges are introduced below.)
  • During the event, we will also have tutorials from our guest speaker.
  • Many rewards, including gift cards and prizes, will be distributed to the teams who perform well in the presentation.
  • Food and drinks will be provided.

Dataset and Introduction:

The Spring 2018’s dataset comes from ASA Datafest event. 

 

Competition Judges: 

  • Prof. Tian Zheng, Associate Director for Education, Data Science Institute at Columbia University
  • Mr. Lucas Lau, Director, Protiviti
  • Dr. Ke Sang, Data Scientist, Indeed.com

Our Mission for the DataFest:

To give all students who love exploring data a hands-on, real-world experience. It does not matter what software and analysis methods you are using. Coding or programming is not our major purpose for this event, good ideas is indeed the take-home message we are trying to create for you. As long as you are interested in playing with real-world data, please come and join us at April 4th!

Agenda

Friday Night (4/6):

6:00pm – 6:15pm:

Introduction to the DataFest

6:15pm – 6:45pm:

Tutorial by Mr. Ke Shen. (Data Scientist from Jet.com

6:45pm – 7:10pm:

Introduction to the Datasets and requirements

7:10pm – 9:00pm:

Group forming

Saturday (4/7):

9:00am – 4:00pm:

Project preparation

 

 

 

 

 

 

 

Sunday(4/8): 

9:00am – 2:00pm: 

Time for group work

2:30pm – 4:00pm: 

Presentation

4:30pm – 5:00pm:

Evaluation from judges