Exercises - Recap of Python¶
Let us perform few exercises to understand how to process the data. We will use LinkedIn data to perform some basic data processing using Python.
- Get LinkedIn archive. - Go to https://linkedin.com 
- Me on top -> Settings & Privacy 
- Then go to “How LinkedIn users your data” -> Getting a copy of your data 
- Register and download. You will get a link as part of the email. 
 
- Data contain multiple CSV files. We will limit the analysis to Contacts.csv and Connections.csv. 
- Get the number of contacts with out email ids. 
- Get the number of contacts from each source. 
- Get the number of connections with each title. 
- Get the number of connections from each company. 
- Get the number of contacts for each month in the year 2018. 
- Use Postgres or MySQL as databases (you can setup in your laptop) and write connections data to the database