Exercises - Recap of Python

Let us perform few exercises to understand how to process the data. We will use LinkedIn data to perform some basic data processing using Python.

  • Get LinkedIn archive.

    • Go to https://linkedin.com

    • Me on top -> Settings & Privacy

    • Then go to “How LinkedIn users your data” -> Getting a copy of your data

    • Register and download. You will get a link as part of the email.

  • Data contain multiple CSV files. We will limit the analysis to Contacts.csv and Connections.csv.

  • Get the number of contacts with out email ids.

  • Get the number of contacts from each source.

  • Get the number of connections with each title.

  • Get the number of connections from each company.

  • Get the number of contacts for each month in the year 2018.

  • Use Postgres or MySQL as databases (you can setup in your laptop) and write connections data to the database