Follow

Data Science angry rant 

I really don't understand why tech-savvy people are still using Excel files to store data. For fucks sake please stop! Excel is SHIT and it mess with your data. It changes texts to dates and it will also fuck up the month and day order in a file.

There is this "data manager" dude that took him 7 months to give me the data and now I have to put up with all his shit (non-ascii characters, random white spaces, random capital letters, ...) and also put up with Excel's shit.

Data Science angry rant 

Seriously, if you are data broker or data manager or data scientist, and when you hear "data" or "database" you immediately think of Excel files, you have to be hospitalized. You either have severe cognitive disorder or you are extremely sadistic. Either way you are danger to the society and you have to be put away.

Data Science angry rant 

@Mehrad
Databases are tables, and the best way to visualize a database is by a spreadsheet. So if someone who is not a computer crack needs to handle data and wants to see what is happening, the spreadsheet is natural choice for that.

Data Science angry rant 

@rudolf Please note that I am talking about storing data and not the medium with which you look at the data.

I use LibreCalc all the time to go through my CSV files, but I NEVER use spreadsheet to store data in spreadsheet format.

Also as a side-note, no, not all databases are tables.

Data Science angry rant 

@Mehrad @rudolf Out of interest, what to you mean by "not all databases are tables"?

Data Science angry rant 

@hexn There are also graph-based databases like Neo4J which is also widely used in some domains. but perhaps you can checkout this wikipedia to know the 4 different types of non-relational databases along with examples and etc.:

en.wikipedia.org/wiki/NoSQL

@rudolf

Data Science angry rant 

@Mehrad @hexn
People using non-relational databases for data also can go and trade in their papers, the good thing about databases is that I can get and present the data how I want, not just how it was thought by the person setting up the database.
@rudolf

Data Science angry rant 

@sotolf

> I can get and present the data how I want

I bet this is what Microsoft people said when they were changing Excel's reference date! Yes, some employee of Microsoft at some point convinced others to change the reference date to calculate dates from 1900-01-01 to 1904-01-01 and now many years down the line I just have 35548 and I have to work out what Excel version the a-hole who have put together the data has used!

tknk.io/WQFl

@hexn @rudolf

Data Science angry rant 

@Mehrad @hexn @rudolf ehm, I hate excel as well, give me a relational database and I can give you almost what you want, but having to muck around with parsing trees and map reduce for getting something that is simple in an RDBMS is kind of wasted effort in my view.

@splatt9990 This is almost what happened to me once. I got an email with a graph inside and I was asked to calculate some correlation or something similar. So I found this amazing tool to "convert" the image to a table:

markummitchell.github.io/engau

Sign in to participate in the conversation
Fosstodon

Fosstodon is an English speaking Mastodon instance that is open to anyone who is interested in technology; particularly free & open source software.