Member-only story
Data is the new oil…yes, but how to store this oil?
Data is the new oil…
This is one phrase we read in almost every data analysis article on the web. However, not very often we read about the toolset needed to build a database and to store these data. It is as if we take for granted the storing step and center our attention on modeling data and building algorithms.
Although there are tons of complex and expensive software and hardware to perform this kind of task massively, for most small to medium projects people usually rely on Excel or CSV files. But as our collection of data and the analysis requirements increases could challenge our computer’s capabilities and software packages as MS Excel to do the job.
Here is when SQL and Python could help us handle the task.
I will be using for this article os, sqlite3 and pandas python libraries. They are very common libraries, so, you shouldn’t have any problem importing them or downloading them with pip.
We are going to explore the basic concepts, but still, by the end of this article we are going to be able to:
- Build a database from scratch
- Build a database importing an…