Exercises - Recap of Python¶
Let us perform few exercises to understand how to process the data. We will use LinkedIn data to perform some basic data processing using Python.
Get LinkedIn archive.
Go to https://linkedin.com
Me on top -> Settings & Privacy
Then go to “How LinkedIn users your data” -> Getting a copy of your data
Register and download. You will get a link as part of the email.
Data contain multiple CSV files. We will limit the analysis to Contacts.csv and Connections.csv.
Get the number of contacts with out email ids.
Get the number of contacts from each source.
Get the number of connections with each title.
Get the number of connections from each company.
Get the number of contacts for each month in the year 2018.
Use Postgres or MySQL as databases (you can setup in your laptop) and write connections data to the database