Skip to main content

Pandas

What is Pandas?

  • Pandas is a Python library used for working with data sets.

  • It has functions for analyzing, cleaning, exploring, and manipulating data.

  • The name Pandas has a reference to both Panel Data, and Python Data Analysis and was created by Wes McKinney in 2008.

info

The source code for Pandas is located at this github repository https://github.com/pandas-dev/pandas

Why Use Pandas?

  • Pandas allows us to analyze big data and make conclusions based on statistical theories.

  • Pandas can clean messy data sets, and make them readable and relevant.

  • Relevant data is very important in data science.

note

Data Science is a branch of computer science where we study how to store, use and analyze data for deriving information from it.

Installation

Open cmd and Type

pip install pandas

Importing Pandas

Once Pandas is installed, import it in your applications by adding the import keyword:

Hello.py
import pandas

Checking Pandas Version

The version string is stored under version attribute.

HelloPandas.py
import pandas as pd
print(pd.__version__)