Data analysis for Python

Data analysis with Python

Why python for Data analysis?

  • what values the analyst extracts
  • what the analyst want to get from data analysis
  • whether the analyst can make the right decision

The process of data analysis

  1. data collection (using open source/crawling …)
  2. data exploration (EDA methods with python)
  3. data preprocessing (using raw data or feature engineering)
  4. data modeling and feedback

The tools for data analysis

  • The most important thing: use the proper tool according to the purpose of data analysis
  • Need to know why you analyze the data and which functions are required

Python

  • A kind of programming (command to a computer with some specific language) language as interpreter language (executing line by line)
  • For communication between a human and computer
  • readability: easy to recognize and read codes
  • open source for python code: libraries and modules (e.g. numpy, pandas …)
  • automatically management of a memory in a computer
  • variables in python codes = space to save some data