Publications by tomaztsql

Advent of 2021, Day 5 – Setting up Spark Cluster

05.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster mode We have explore the Spark architecture and look into the differences between local and cluster mode. So, if you navigate to your local installation ...

2916 sym R (452 sym/8 pcs) 12 img

Advent of 2021, Day 6 – Setting up IDE

06.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark Cluster Let’s look into the IDE that can be used to run Spark. Remember that Spark can be used with languages: Scala, Jav...

1497 sym R (571 sym/3 pcs) 6 img

Advent of 2021, Day 7 – Starting Spark with R and Python

07.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDE Let’s look into the local use of Spark. For R language, sparklyr package is availble and ...

2214 sym R (838 sym/9 pcs) 8 img

Advent of 2021, Day 9 – RDD Operations

09.12.2021

Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD files Two types of operations are available with RDD; tra...

2792 sym R (380 sym/3 pcs) 10 img

Advent of 2021, Day 10 – Working with data frames

10.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operation...

2446 sym R (1506 sym/7 pcs) 2 img

Advent of 2021, Day 11 – Working with packages and spark dataFrames

11.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

2323 sym R (1185 sym/7 pcs) 2 img

Advent of 2021, Day 12 – Spark SQL

12.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

2346 sym R (1605 sym/6 pcs) 2 img

Advent of 2021, Day 13 – Spark SQL bucketing and partitioning

13.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

3674 sym R (956 sym/5 pcs) 2 img

Advent of 2021, Day 14 – Introduction to Spark Streaming

15.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

3229 sym R (1941 sym/7 pcs) 2 img

Advent of 2021, Day 16 – Dataframe operations for Spark streaming

16.12.2021

Series of Apache Spark posts: Dec 01: What is Apache SparkDec 02: Installing Apache SparkDec 03: Getting around CLI and WEB UI in Apache SparkDec 04: Spark Architecture – Local and cluster modeDec 05: Setting up Spark ClusterDec 06: Setting up IDEDec 07: Starting Spark with R and PythonDec 08: Creating RDD filesDec 09: RDD Operatio...

3574 sym R (1464 sym/8 pcs) 2 img