Publications by tomaztsql

Advent of 2021, Day 7 – Starting Spark with R and Python

07.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDE Let’s look into the local use of Spark. For R language, sparklyr package is availble and ...

2214 sym R (838 sym/9 pcs) 8 img

Advent of 2021, Day 9 – RDD Operations

09.12.2021

Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD files Two types of operations are available with RDD; tra...

2792 sym R (380 sym/3 pcs) 10 img

Advent of 2021, Day 10 – Working with data frames

10.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operation...

2446 sym R (1506 sym/7 pcs) 2 img

Advent of 2021, Day 11 – Working with packages and spark dataFrames

11.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

2323 sym R (1185 sym/7 pcs) 2 img

Advent of 2021, Day 12 – Spark SQL

12.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

2346 sym R (1605 sym/6 pcs) 2 img

Advent of 2021, Day 13 – Spark SQL bucketing and partitioning

13.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

3674 sym R (956 sym/5 pcs) 2 img

Advent of 2021, Day 14 – Introduction to Spark Streaming

15.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

3229 sym R (1941 sym/7 pcs) 2 img

Advent of 2021, Day 16 – Dataframe operations for Spark streaming

16.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

3574 sym R (1464 sym/8 pcs) 2 img

Advent of 2021, Day 17 – Watermarking and joins for Spark streaming

17.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

5224 sym R (852 sym/4 pcs) 6 img

Advent of 2021, Day 19 – Data engineering for Spark Streaming

19.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

6692 sym R (3911 sym/11 pcs) 6 img