Skip to main navigation Skip to content

PYTHON FOR DATA SCIENCE at GA

PYTHON FOR DATA SCIENCE at GA

About This Workshop

Python is a versatile and widely-used programming language with many applications. This workshop explores Python's place in the scientific ecosystem, and how the language, with several readily-available open-source libraries, can serve as a powerful tool for data analysis. Designed as a stand-alone introduction to the data science aspects of Python, this class is also a recommended preparatory workshop for students planning to enroll in General Assembly's Data Science course. We will also be covering topics from probability, statistics, regression and classification.

By the end of the workshop, students will be able to:

  • Use pandas to perform data analysis
  • Use matplotlib to make visualizations of the data
  • Sci-kit learn to generate a statistical model of the data
  • Have a working understanding of the statistical models being used

Takeaways

  • Ability to use Python for basic data analysis tasks (data cleaning and reformatting, exploration, analysis)
  • Understanding of Python’s role as a tool in the backend and analysis toolkits
  • Understanding of the possibilities opened up through a better understanding of Python
  • Greater preparedness for GA's Data Science course

Preparation

    • Students must have taken Python Programming 101 or have equivalent foundational knowledge. Note, we will NOT be going through introductory Python. Here is an example exercise you should be able to do before taking this class: Write a function that prints all of the numbers between 1 and 10,000 that are either divisible by 7 or 33 but not both.

Please bring a laptop with Anaconda installed.

Anaconda Installation Instructions:

Visit https://www.anaconda.com/download/ Download the latest 3.x version. When the download is complete, run the installer.

For more information on this event please visit the General Assembly site here!