RIT ASA DataFest 2024

This event will be held from March 22-24, 2024  on the RIT Campus in Xerox Auditorium which is Room 2580 in Gleason Hall (Building 9 - Engineering Building)  

Undergraduate students from various colleges and universities in the Greater Rochester area and neighboring cities will participate in national festival of data sanctioned by the American Statistical Association.

DataFest Logo

What is ASA DataFest?

The American Statistical Association (ASA) DataFest is a celebration of data in which teams of undergraduates work around the clock to find and share meaning in a large, rich, and complex data set. Undergraduate students from the Rochester area will participate in this 3-day data analysis competition sponsored by the American Statistical Association. Benefits of taking part in the DataFest competition include:

  • Collaborating with a team to analyze a highly comprehensive and intricate dataset provided by a real organization - potentially the most extensive one you have encountered to date.
  • Connecting with other data science experts and students from diverse colleges and universities.
  • Forging meaningful relationships that will enrich your learning and aid in your professional growth.
  • Acquiring valuable experiences to discuss in future job interviews.
  • Showcasing your ability to overcome obstacles, excel under pressure, and hone your problem-solving skills that are transferable to any workplace.
  • Working alongside students with varying levels and backgrounds in data science education.

Event Schedule - March 22-24, 2024

DataFest will be held in the Xerox Auditorium which is located in the James E. Gleason Hall in the Kate Gleason College of Engineering. 

Friday, March 22nd, 2024 
From 5:30 pm to 11:00pm

  • 5:30-6:30pm
    • Registration (GLE 2580 - Xerox Auditorium)
    • Pick up the nondisclosure agreement form
    • Pick up your badges and T-shirts!
    • Dinner
  • 6:30pm
    • Opening ceremony
    • Submission of nondisclosure form
    • Extra Team formation for non-assigned participants
  • 6:50pm
    • Revelation of the dataset
    • Brief overview of overarching goal of contest
  • 7:00pm 
    • Official start of DataFest@RIT 2024
    • Clear indication of all breakout rooms (Rooms with DataFest posters)
  • 8:00pm
    • Starting Time of Declaration of chosen category to compete in
    • Tutorial 1 - Data Manipulation (Informal/short with pointers to videos)
  • 9:00pm
    • Drinks and snacks will be available
    • Tutorial 2 - Modelling (Informal/short with pointers to videos)
  • 10:00pm
    • Light entertainment (Outside the auditorium)
  • 11:00pm
    • Closing of the festival venue

Saturday, March 23rd, 2024 (Main day of the event)
From 8:00 am to 11:00pm

  • 8:00-9:30am
    • Breakfast
  • 10:00am
    • Tutorial 3 - Tips for Practical Statistical Knowledge Discovery and Predictive Analysis by Dr Ernest Fokoue
  • 11:30am-12:30pm
    • Lunch
  • 1:00pm
    • Tutorial 4 - Tips for Efficient Coding and High Performance Computing. 
  • 3:00pm
    • Tutorial 5 - Tips for Writing a compelling final report
  • 5:30-6:30pm
    • Dinner
  • 9:00pm
    • Due time for declaration of chosen category to compete in. 
  • 11:00pm
    • Announcement of the Sunday presentation schedule
    • Closing of festival venue

Sunday, March 24th, 2024 (Final day)
8:00 am to 4:00pm

  • 8:00-9:30am
    • Breakfast
  • 10:30am
    • Opening of the submission portal on DataFest@RIT Google form.
  • 11:00am
    • Deadline of submission of final report
    • Encouragement to complete all slides for 1:00pm presentation 
  • 11:30am-12:30pm
    • Lunch 
  • 1:00-2:20pm
    • Parallel sessions of presentations  
    • Best Insight 
      • GLE 2580 - Xerox Auditorium
    • Best Visualization
      • GLE 2149
    • Best Use of External Data
      • GLE 2159
  • 3:00pm-4:00pm
    • Awards ceremonies with photos and videos!
    • 3:15-3:22pm
      • Presentation by Best Insight Gold
    • 3:22-3:29pm
      • Presentation by Best Visualization Gold
    • 3:29-3:36pm
      • Presentation by Best Use of External Data Gold
    • Interview of DataFesters (Photos and videos)
    • Closing remarks
  • 4:00pm
    • Cleaning and closing of DataFest venue

Registration is now closed.
 

Team Registration

Each team captain must take the responsibility of registering his/her team. 

Team Registration

Team Member Registration

Once the team is registered the captain should encourage all the members to INDIVIDUALLY register, otherwise, the team will not be able to participate. 

Team Member Registration

Mentor Registration

If you are a graduate student wishing to help with the mentoring of contestants.

Mentor Registration

Faculty Registration

If you are a local faculty member or a data scientist in our community willing to help.

Faculty Registration

Student Info and Guidelines

DataFest Location

Main room: Xerox Auditorium
(located in the James E. Gleason Hall - Kate Gleason College of Engineering)

Additional Rooms: Classrooms near the Xerox Auditorium.


Supplies

  • We recommend that every team member have a desktop or laptop available for use during the competition. You might find it helpful to have a mix of PCs and Macs since they have different strengths. We recommend that you make sure the software you will be using throughout the weekend is installed correctly and running on your computer before the competition. You will be working with a large dataset so make sure that you have the space for it on your drive.
  • You might want to have some of your favorite statistical or computational reference books ready to be used if you have them, and bookmark the pages that you regularly use.

Large Data Advice

  • The dataset you will be working with is quite large.  If you type a variable name to view it, it will take a while to display. Therefore, remember these R commands: head(), tail(), str(). 
  • We strongly recommend you create a small data set that you can use to test things on. Then, if it works out, you can apply your procedure to the large dataset.  Some procedures can take a long time to run on large data sets, and so it will be good to know that your procedure works (because you tested it on a smaller data set) while you wait.  We recommend taking a random sample of rows from the original data set, but there might be other approaches you find useful. 

DataFest Rules

  • Before downloading the dataset you must sign the non-disclosure agreement by agreeing to the terms of use and entering your name and email address. At the end of DataFest, delete all data from thumb drives, hard drives, etc. The data is sensitive. 
  • Should members of your team drop out at the last minute, you might be asked to join another team that is also missing members. 
  • At all times between 9 am-12midnight there will be a consultant present. These are faculty, grad students, or other professionals with field-specific knowledge on the dataset. They all have different areas of expertise. Feel free to ask anything. This is not an exam, but a competition. Do not expect the consultants to write code for you, do data management, etc. They are there to help point you in the right direction, but you're responsible for getting there on your own. The schedule of consultants will be made available at the beginning of the event. 

DataFest Judging

  • Each team will have five minutes to present their findings to the judges. 
  • At some point on Friday, you might want to set aside time to think about what you want the judges to know. The five-minute time limit will be strictly enforced. At least one member must be present for the presentation.
  • Your report must be submitted to the designated Google drive by 11 AM Sunday. Allowed formats: PDF. If using a web-based tool like Google Docs, please export to PDF and send the PDF as your submission.
  • Your slides should be ready by 1 PM Sunday on your own computer (Zoom links will be sent to you then), when the parallel presentations start. You don't need to submit you slides. 
  • Awards will be given in three categories:
    • Best Insight
    • Best Visualization 
    • Best Use of External Data

Steering Committee

Kate Koch headshot
Student and Administrative Support Specialist
School of Mathematics and Statistics
College of Science
585-475-2498

RIT School of Mathematics and Statistics Students

Zi-jia Gong
Ph.D. Student in Mathematical Modeling

Xiwen Mark
Student Worker
 

DataFest Sponsors

DATAFEST@RIT 2024 – SPONSORS

Inspiring and empowering the next generation of world class data scientists! The Organizing Committee of the 2024 Edition of DataFest@RIT wishes to express their thanks to all the past, present and future sponsors. Without these champions of data science, we cannot continue this tradition that strengthens our undergraduate students! For a brief history of the ASA DataFest and some of the participating institutions from the past, click here.

Sponsorship Levels

Cauchy Sponsor - $5,000
1. Access to “Meet the Sponsors” Career Fair
2. Access to the Resume Book
3. Invitation to join the Email Listserv
4. Full-page ad in the DATAFEST@RIT 2024 conference main booklet/conference program
5. Large logo prominently placed on all DATAFEST@RIT 2024 banners
6. Logo prominently placed on the DATAFEST@RIT 2024 poster
7. Large logo and company link on the DATAFEST@RIT 2024website
8. Short company profile and link on DATAFEST@RIT 2024 social media (FB, Twitter, etc.)
9. Company name displayed in the DATAFEST@RIT 2024 main conference desk during event

Pareto Sponsor - $2,500
1. Access to “Meet the Sponsors” Career Fair
2. Access to the Resume Book
3. Invitation to join the Email Listserv
4. Medium logo placed on all DATAFEST@RIT 2024 banners
5. Logo placed on the DATAFEST@RIT 2024 poster
6. Medium logo and company link on the DATAFEST@RIT 2024 website
7. Short company profile and link on DATAFEST@RIT 2024 social media (FB, Twitter, etc.)
8. Company name displayed in the DATAFEST@RIT 2024 main conference desk during event

Lognormal Sponsor - $1,000
1. Access to the Resume Book
2. Invitation to join the Email Listserv
3. Small logo placed on all DATAFEST@RIT 2024 banners Logo placed on the DATAFEST@RIT 2024 poster
4. Small logo and company link on the DATAFEST@RIT 2024 website
5. Short company profile and link on DATAFEST@RIT 2024 social media (FB, Twitter, etc.)
6. Company name displayed in the DATAFEST@RIT 2024 main conference desk during event

Weibull Sponsor - $500
1. Short company profile and link on DATAFEST@RIT 2024 social media (FB, Twitter, etc.)
2. Acknowledgment of company on the DATAFEST@RIT 2024 website
3. Company name displayed in the DATAFEST@RIT 2024 main conference desk during event

Gauss Sponsor - $100
1. Acknowledgment of company the DATAFEST@RIT 2024 website
2. Company name displayed in the DATAFEST@RIT 2024 main conference desk during event

Uniform (Individual) Sponsor - $50
1. DATAFEST@RIT 2024 Memorabilia
 

Thank you to our past DataFest@RIT Sponsors!
 

We would like to thank our generous past sponsors for UPSTAT 2011-2019. Please review the sponsorship levels below and consider a donation to this event.

• Rochester Institute of Technology
• University of Rochester
• Praxair
• Xerox
• iCitizen
• Rochester Data Science Consortium
• Harris Corporation
• Wegmans
• Conduent
• Center for Quality and Applied Statistics (CQAS)
• JMP Statistical Discovery
• American Statistical Association (ASA)
• WITR 89.7
• M&T Bank
• Corning

Lognormal Sponsors:

  • Rochester Data Science Consortium
  • Wegmans
  • The American Statistical Association
  • Harris Corporation
  • M&T Bank

Weibull Sponsors:

  • Corning
  • CQAS @ RIT
  • UR Dept of Biostatistics and Computational Biology

DataFest Winners

Winners to be announced.

Data Science Jobs in High Demand


Sean Lahman, from the Rochester Democrat & Chronicle talks with RIT Professor Ernest Fokoué about data science and the school's DataFest. - March 23, 2017.