Introduction to Azure Databricks using R


Course Number: AZDB-142
Duration: 2 days (13 hours)
Format: Live, hands-on

Azure Databricks using R Training Overview

This in-person or online Introduction to Azure Databricks using R training course teaches attendees how to scale R applications for complex analytics and data science operations on the Azure Databricks, Microsoft’s cloud-based Apache Spark platform. This class is hands-on and can be customized to your team's goals and needs. 

Location and Pricing

This course is taught as a private course in-person or online for teams of 3 or more. To receive a quote for online corporate training, please contact us.

In addition, some courses are available as live, instructor-led training from one of our partners.

Objectives

  • Understand What Databricks is and its architecture.
  • Work in the Databricks environment.
  • Learn What Spark SQL is and How to Write Spark Applications using it.
  • Understand Concepts of running R on Spark.
  • Write Spark Applications Using the SparkR API (library).
  • Write Spark Applications Using the sparklyr API (library).

Prerequisites

Prior knowledge of R and SQL are presumed.

Outline

Expand All | Collapse All

Databricks Introduction
  • Getting Things Ready
  • Tour the Databricks Workspace
  • Create a Spark Cluster
  • Create Spark Tables
Using Databricks Notebooks
  • Using Spark SQL in a Databricks Notebook
  • Touring the Databricks Notebook
  • Managing cells
  • Managing Notebooks
  • Finding Sample Notebooks
Visuals and Dashboards
  • Creating and Customizing Visuals
  • Creating Dashboards
Exploring Spark SQL
  • Creating Tables Over Flat Files (Schema On Read)
  • Common SQL Operations
  • JOINS
  • UNION
  • Scalar Functions
  • Aggregations
  • Creating Views and Tables
  • Common Table Expressions (CTE)
  • Reading and Writing Data
  • Saving to parquet files
  • Saving to Delta Tables
  • Using SQL from R
Intro to R on Spark
  • Running R locally on Spark
  • Importing R Libraries
Using SparkR
  • Intro to SparkR
  • Differences Between R and SparkR
  • Understanding Apache Arrow
Performing Exploratory Data Analysis (EDA)
  • Reading and Writing Data
  • Writing Custom User Defined Functions
Intro to Sparklyr
  • Differences Between SparkR and sparklyr
  • Using sparkly
Performing EDA with sparklyr
  • Reading and Writing Data
Conclusion

Training Materials

All Azure Datatbricks training attendees receive a copy of the instructor’s handout and all code created during the class.

Software Requirements

Attendees will write applications using the Databricks service running on the cloud. 



Learn faster

Our live, instructor-led lectures are far more effective than pre-recorded classes

Satisfaction guarantee

If your team is not 100% satisfied with your training, we do what's necessary to make it right

Learn online from anywhere

Whether you are at home or in the office, we make learning interactive and engaging

Multiple Payment Options

We accept check, ACH/EFT, major credit cards, and most purchase orders



Recent Training Locations

Alabama

Birmingham

Huntsville

Montgomery

Alaska

Anchorage

Arizona

Phoenix

Tucson

Arkansas

Fayetteville

Little Rock

California

Los Angeles

Oakland

Orange County

Sacramento

San Diego

San Francisco

San Jose

Colorado

Boulder

Colorado Springs

Denver

Connecticut

Hartford

DC

Washington

Florida

Fort Lauderdale

Jacksonville

Miami

Orlando

Tampa

Georgia

Atlanta

Augusta

Savannah

Hawaii

Honolulu

Idaho

Boise

Illinois

Chicago

Indiana

Indianapolis

Iowa

Cedar Rapids

Des Moines

Kansas

Wichita

Kentucky

Lexington

Louisville

Louisiana

New Orleans

Maine

Portland

Maryland

Annapolis

Baltimore

Frederick

Hagerstown

Massachusetts

Boston

Cambridge

Springfield

Michigan

Ann Arbor

Detroit

Grand Rapids

Minnesota

Minneapolis

Saint Paul

Mississippi

Jackson

Missouri

Kansas City

St. Louis

Nebraska

Lincoln

Omaha

Nevada

Las Vegas

Reno

New Jersey

Princeton

New Mexico

Albuquerque

New York

Albany

Buffalo

New York City

White Plains

North Carolina

Charlotte

Durham

Raleigh

Ohio

Akron

Canton

Cincinnati

Cleveland

Columbus

Dayton

Oklahoma

Oklahoma City

Tulsa

Oregon

Portland

Pennsylvania

Philadelphia

Pittsburgh

Rhode Island

Providence

South Carolina

Charleston

Columbia

Greenville

Tennessee

Knoxville

Memphis

Nashville

Texas

Austin

Dallas

El Paso

Houston

San Antonio

Utah

Salt Lake City

Virginia

Alexandria

Arlington

Norfolk

Richmond

Washington

Seattle

Tacoma

West Virginia

Charleston

Wisconsin

Madison

Milwaukee

Alberta

Calgary

Edmonton

British Columbia

Vancouver

Manitoba

Winnipeg

Nova Scotia

Halifax

Ontario

Ottawa

Toronto

Quebec

Montreal

Puerto Rico

San Juan