Distributed Computing with Spark SQL

Course Feature
  • Cost
    Free
  • Provider
    Coursera
  • Certificate
    Paid Certification
  • Language
    English
  • Start Date
    No Information
  • Learners
    No Information
  • Duration
    No Information
  • Instructor
    Learn SQL Basics for Data Science Specialization
Next Course
2.5
0 Ratings
This course is all about big data and distributed computing using Apache Spark. It is designed for students with SQL experience who want to take the next step on their data journey. Through four modules, students will gain a thorough understanding of the Spark architecture, queries within Spark, common ways to optimize Spark SQL, and how to build reliable data pipelines. They will also learn about storage vs. compute, caching, partitions, and troubleshooting performance issues via the Spark UI. Additionally, students will explore new features in Apache Spark 3.x such as Adaptive Query Execution, connecting to databases, schemas and data types, file formats, and writing reliable data. Finally, they will learn about data lakes, data warehouses, and lakehouses, and build production grade data pipelines by combining Spark with the open-source project Delta Lake. By the end of this course, students will have honed their SQL and distributed computing skills to become more adept at advanced analysis.
Show All
Course Overview

❗The content presented here is sourced directly from Coursera platform. For comprehensive course details, including enrollment information, simply click on the 'Go to class' link on our website.

Updated in [May 30th, 2023]

Introducing Distributed Computing with Spark SQL:

Are you looking to take your data journey to the next level? Distributed Computing with Spark SQL is the perfect course for you! This course is designed to help students with SQL experience gain a thorough understanding of Apache Spark, an open-source standard for working with large datasets. Through four modules, you will learn the fundamentals of data analysis using SQL on Spark, setting the foundation for how to combine data with advanced analytics at scale and in production environments. You will also gain an understanding of the Spark architecture, queries within Spark, common ways to optimize Spark SQL, and how to build reliable data pipelines.

By taking this course, you will be able to hone your SQL and distributed computing skills to become more adept at advanced analysis and to set the stage for transitioning to more advanced analytics as Data Scientists. This course is also a great way to explore possible development paths in your career or education, as well as related learning suggestions. So, if you’re ready to take your data journey to the next level, Distributed Computing with Spark SQL is the perfect course for you!

Show All
Recommended Courses
free introduction-to-transact-sql-16127
Introduction to Transact SQL
4.5
Udemy 1 learners
Learn More
Explore the essentials of Introduction to Transact SQL
free implementing-etl-with-sql-server-integration-services-16128
Implementing ETL with SQL Server Integration Services
2.0
Edx 278 learners
Learn More
Gain an introduction to Implementing ETL with SQL Server Integration Services
free sql-tutorial-full-database-course-for-beginners-16129
SQL Tutorial - Full Database Course for Beginners
5.0
freeCodeCamp 13 learners
Learn More
This SQL Tutorial - Full Database Course for Beginners is an online course designed to teach beginners the basics of SQL. It covers topics such as what a database is, tables and keys, SQL basics, MySQL installation, creating tables, inserting data, constraints, updating and deleting data, basic queries, company database introduction, creating a company database, more basic queries, functions, wildcards, union, joins, nested queries, on delete, triggers, ER diagrams introduction, designing an ER diagram, and converting ER diagrams to schemas. This course is comprehensive and provides a great introduction to SQL for beginners.
free beginners-introduction-to-sql-and-databases-16130
Beginners Introduction to SQL and Databases
4.5
Udemy 3 learners
Learn More
This online course is a great introduction to SQL and databases for beginners. It covers the basics of SQL and databases, how to connect to MySQL, and an introduction to MySQL. It is a comprehensive course that will help students understand the fundamentals of SQL and databases. It is well-structured and easy to follow, making it a great starting point for those new to the subject. The course is also suitable for those who want to refresh their knowledge of SQL and databases. Overall, this is an excellent course for anyone looking to learn the basics of SQL and databases.
Favorites (0)
Favorites
0 favorite option

You have no favorites

Name delet
arrow Click Allow to get free Distributed Computing with Spark SQL courses!