How to transform data types in pandas

Question

QA Hub Editorial · Accepted Answer

Short answer

Correct data types reduce memory usage, enable proper operations, and prevent subtle bugs caused by implicit type coercion.

Inspect current types using dtypes and identify columns that need conversion.
Convert numeric strings to integers or floats using pd.to_numeric with error handling.
Parse date strings into datetime objects using pd.to_datetime with format specification.
Convert low-cardinality object columns to category type for memory efficiency.
Validate conversions by checking dtypes again and sampling values.

import pandas as pd

df = pd.read_csv('data.csv')
df = df.dropna().drop_duplicates()
df['date'] = pd.to_datetime(df['date'])
print(df.head())

This example shows how to load a CSV, remove missing values and duplicates, and parse dates in just a few lines of pandas code.