this post was submitted on 08 Jul 2023
-12 points (33.3% liked)

Python

6412 readers
6 users here now

Welcome to the Python community on the programming.dev Lemmy instance!

πŸ“… Events

PastNovember 2023

October 2023

July 2023

August 2023

September 2023

🐍 Python project:
πŸ’“ Python Community:
✨ Python Ecosystem:
🌌 Fediverse
Communities
Projects
Feeds

founded 1 year ago
MODERATORS
 

Are you doing data science? Statistics? No?

Then for god's sake don't use pandas, you just look dumb af when you pull several MB of a package just to load csv. If you find yourself doing that, just stop programming and look for another job

Thanks for attention

all 8 comments
sorted by: hot top controversial new old
[–] _danny@lemmy.world 27 points 1 year ago (1 children)

If you find yourself doing that, just stop programming and look for another job

I don't think that is an appropriate response to someone misunderstanding a package. Just educate them in a kind and respectful manner and they'll learn.

[–] toikpi 6 points 1 year ago (1 children)

For those that don't know, the standard library includes a csv package so you can just import csv.

Documented at https://docs.python.org/3/library/csv.html

[–] muppetjones@lemm.ee 5 points 1 year ago

I use DictReader all the time, along with a library to hande the type detection. This is the way to go, especially if you need to process line by line or filter columns and rows out first.

Regardless, I'll avoid pandas wherever I can. It's not something I want in production level code if I can help it.

[–] ActuallyRuben@actuallyruben.nl 2 points 1 year ago (1 children)

Wait what, people are loading data from csv in their websites?

[–] Michal@discuss.tchncs.de 2 points 1 year ago

Makes sense if you want to give the user the ability to import data from csv (see django import export package). Beyond importing data from user or another service i dont see other uses, but they do exist.

[–] Michal@discuss.tchncs.de 1 points 1 year ago (1 children)

Pandas is more efficient than Python at operating on large datasets. Can you suggest alternarives?