I recently had the opportunity to work on a large dataset (roughly 1TB) and analyze it using Python for filtering data based on some given parameters. In this post, I’ll be summing up how I accomplished the task using Python and used various measures to make the filtering process efficient.
Task Details
We have a large number of gzip files... more
Even stronger than Modi’s wave in 2014 Lok Sabha Elections is the drive of every second-year and third-year in KGP going into the field of data analytics; and, sadly, unlike Modi wave it’s here to stay for many following decades.
Data Analytics as a field has seen nothing less of a bubble than what was seen by the Internet in... more