Processing Column DataΒΆ
As part of this module we will explore the functions available under org.apache.spark.sql.functions
to derive new values from existing column values with in a Data Frame.
Pre-defined Functions
Create Dummy Data Frame
Categories of Functions
Special Functions - col and lit
String Manipulation Functions - 1
String Manipulation Functions - 2
Date and Time Overview
Date and Time Arithmetic
Date and Time - trunc and date_trunc
Date and Time - Extracting Information
Dealing with Unix Timestamp
Example - Word Count
Conclusion