How to Parse and Analyze CSV Data in Python

May 9, 2025 less than 1 minute read

Introduction

CSV (Comma-Separated Values) files are common in data science. Using simple string operations and lists/dictionaries, I processed .csv files like iris.csv and mpg.csv to extract useful insights.

Reading and Parsing a CSV File

file = open("iris.csv", "r")
lines = file.readlines()
file.close()

for line in lines[1:]:
    values = line.strip().split(",")
    print(values)

Converting to Dictionary for Easier Use

irisDict = {
    "sepal_length": values[0],
    "sepal_width": values[1],
    "petal_length": values[2],
    "petal_width": values[3],
    "species": values[4]
}

This structure makes it easier to calculate averages or filter data.

What I learned

.readlines() + .strip().split(",") is a basic but powerful way to parse CSVs.
Dictionaries are useful for organizing row data by column names.
Loops help calculate statistics like averages or category counts.

What I want to do next

Try using the built-in csv module for more robust handling.
Apply filtering and grouping logic to larger datasets.

Share on

X Facebook LinkedIn Bluesky

Oracle Free Tier Limitations: Regional Resource Exhaustion and Deployment Dilemmas

June 24, 2025 1 minute read

Analyzing the practical issues of using Oracle Cloud Free Tier in Korea—especially the challenge of regional resource shortages that block new VM deployments.

Is Using AI to Write Code Helping or Hurting My Long-Term Growth?

June 23, 2025 2 minute read

Reflecting on whether relying on AI tools like ChatGPT or Claude for coding helps deepen understanding or hinders the development of true programming skills.

Why Feature Engineering and Domain Knowledge Outperform Fancy Models

June 21, 2025 2 minute read

This post highlights why feature selection and domain knowledge matter more than complex models, especially when building real-world ML solutions.

What is LLM Fine-Tuning? Making the Model Speak Your Language

June 20, 2025 1 minute read

This post explains the concept of fine-tuning large language models (LLMs) from a practical perspective, focusing on shaping model outputs through diverse an...

Zeu Park