P69 Latam Partner Bootcamp - Data Platform

You might also like

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 71

Cortana Intelligence Suite

Overview

Nathan Colossi
Cloud Solution Architect
Data and AI
Transform data into intelligent action
Information Big Data Stores Machine Learning Intelligence
Management and Analytics
Data People
Sources
Machine Cognitive
Data Factory Data Lake Store
Learning Services

SQL Data Data Lake Bot Web


Data Catalog Warehouse Analytics Service
Apps
HDInsight
Event Hubs (Hadoop and Cortana Mobile
Cosmos DB
Spark) Apps

Stream Analytics Bots


Dashboards &
IOT Hubs
Visualizations
Sensors Azure
Azure Analysis
Analysis
and Services
Services
devices Power BI
Azure Databricks
Automated
Systems

Data Intelligence Action


Use Case: Data Warehouse and Modern BI
 We seek to deploy a cloud-backed OLTP to OLAP
transformation pipeline that accommodates:
 Hosting multiple data marts with each holding subsets of the OLAP dataset.
 Batch load pipelines to support load and trickle loading of both the central
data lake and the data marts.
 Monitoring & alarming to track Fact/Dimension updating failures both in the
data lake & it's subsequent propagation into the data marts.
Data Warehouse and Modern BI: Architecture
Data Warehouse and Modern BI: Components
Information Big Data Stores Machine Learning Intelligence
Management and Analytics
Data People
Sources
Data Lake Store Machine Cognitive
Data Factory & Azure Storage Learning Services

SQL Data Data Lake Bot Web


Data Catalog Warehouse Analytics Service
Apps
HDInsight
Event Hubs (Hadoop and Cortana Mobile
Cosmos DB
Spark) Apps

Stream Analytics Bots


Dashboards &
IOT Hubs
Visualizations
Sensors Azure
Azure Analysis
Analysis
and Services
Services
devices Power BI
Azure Databricks
Automated
Systems

Data Intelligence Action


Azure SQL Database
AZURE SQL DATABASE
THE BEST AND MOST ECONOMICAL
CLOUD DESTINATION FOR YOUR SQL
SERVER APPS
Built-in intelligence Breakthrough productivity Seamless and compatible Competitive TCO
and performance

Realize up to a 406% ROI over on-premises and hosted solutions


AZURE SQL DATABASE MOMENTUM

4.1MILLION 154PB 421K


databases total size subscriptions

3TRILLION 1.4 MILLION 100MILLION


queries analyzed/hour
batch requests/day HTAP transactions/sec

“We had an incident lasting for about ““Intelligent Insights proactively “SQL Threat Detection helps us to
6 months. Before Intelligent insights finds a database performance be ahead of the threats instead
we have not had a way of figuring out problem in a more efficient way of chasing them.”
where do we even start and much faster than humans.
troubleshooting. Intelligent Insights With it we can proactively help
gave us a list of things to do. What customers until we have a fix Shahin Kohan, CTO
Intelligent Insight does is that it for the problem.”, Bauke Stil,
enables us to pinpoint where the “SQL Threat Detection helps us
Application Manager, SnelStart
problem is and to get a fix deployed respond to activities, which were
within 24hrs.” not visible beforehand.”

Frans Lytzen – CTO, New Orbit Bauke Stil, App Manager, SnelStart Manrique Logan, architect & technical
lead
INTRODUCING AZURE SQL DATABASE MANAGED INSTANCE

Azure SQL Database

Managed
SQL SQL Single Elastic Pool
Instance
Instance-scoped programming Standalone managed Shared resource model for
model with high compatibility database for predictable and greater efficiency through
to SQL Server stable workloads multi-tenancy

Best for modernization at


scale with low cost and effort
SEAMLESS CLOUD
I N T E G R AT I O N
Easy lift-and-shift migration
Azure SQL Database
Managed Instance
Azure SQL Database Managed
Instance p r i v at e p r e v i e w
facilitates lift and shift migration from on-premises
SQL Server to cloud Azure Hybrid Benefit
(AHB) for SQL Server
Azure Hybrid Benefit for SQL
Server
maximizes current on-premises license investments
to facilitate migration
Database Migration
Database Migration Service Service (DMS)
(DMS) p r i v a t e p r e v i e w
provides seamless and reliable migration at scale
with minimal downtime

Most consistent data platform


Classified as Microsoft Confidential
Azure Data Factory
Azure Data Factory
• Orchestrate, monitor & schedule - Compose data
processing, storage & movement services (on premises &
cloud)

• Automatic infrastructure management


• Combine pipeline intent w/ resource allocation & mgmt
• Globally available data movement capability

• Single pane of glass - One place to manage and


monitor your data pipelines
Data Factory Data Set Activity: a processing step
Pipeline: a sequence of
activities (logical group)
Concepts
(Collection of files, (Hadoop job, custom code, ML model, etc)
DB table, etc)


Data Sources Ingest Transform & Analyze Publish

Game Logs Game Log Files


Transform,
Combine, etc Analyze Move Visualize

Customer Table

Customer Table

Customer Customer
On Premises Table
Geocode
Game Usage
Data Mart
Azure Blob Storage Azure DB
Basic Concepts
• ADF is Microsoft’s unified platform for ETL/ELT services in the cloud
• ADF allows you to build data pipelines and execute them/schedule their runs
• Data pipeline is a chain/group of activities to be performed on your data, e.g. data
movements/transformations
• Some activities are powered by services with data store/compute resources allocated outside ADF, e.g.
HDI/ML/etc.

* Integration Runtime available


only on v2
Command and Control
Data

UX & SDK
Data Factory
Authoring | Monitoring/Mgmt A data integration account.
Location of orchestration, service metadata
Azure Data Factory v2 Service
Scheduling | Orchestration | Monitoring

Pipeline SSIS
Package
Integration Runtime (IR)
ADF’s execution engine

Three core capabilities:


• data movement
• pipeline activity execution
• SSIS package execution

Self Hosted
Integration Azure
Integration
To integrate data flow and control flow across the
Runtime
Runtime enterprises’ hybrid cloud, customer can instantiate
multiple IR instances for different network
environments:
On Prem Apps & Cloud Svcs, Apps - On premises (similar to DMG in ADF V1)
Data & Data - In public cloud
- Inside VNet
Azure SQL Data Warehouse
A Z U R E S Q L D ATA WA R E H O U S E

PB scale with MPP and independent compute/ storage scale

G r o w, s h r i n k , a n d p a us e i n s e c o n d s

PREVIEW
Compute-optimized for demanding workloads

PREVIEW
U n l i m i t e d c ol u m n a r s t o r a g e
Fully managed PaaS
Data Warehouse Units DWU

Normalized amount of compute 100


200
Converts to billing units i.e. what you pay 300
400
500
600
CPU RAM I/O 1000
1200
1500
2000
3000
6000

30000
Target workload: Analytics
Store large volumes of data
Consolidate disparate data into a single location
Shape, model, transform and aggregate data
Perform query analysis across large datasets
Ad-hoc reporting across large data volumes
All using simple SQL constructs
“SQL on SQL”
Unsuitable workloads
Operational workloads
High frequency reads & writes
Large numbers of singleton selects
High volume of single row inserts
Data Preparation
Row by row processing needs
Incompatible formats (JSON, XML)
Scaling and Pausing

Control
Compute
Storage
Distributing Data
CREATE TABLE [build].[FactOnlineSales]
(
[OnlineSalesKey] int NOT NULL
, [DateKey] datetime NOT NULL
, [StoreKey] int NOT NULL
, [ProductKey] int NOT NULL
, [PromotionKey] int NOT NULL
, [CurrencyKey] int NOT NULL
, [CustomerKey] int NOT NULL
, [SalesOrderNumber] nvarchar(20) NOT NULL
, [SalesOrderLineNumber] int NULL
, [SalesQuantity] int NOT NULL
, [SalesAmount] money NOT NULL
)
WITH
(
DISTRIBUTION = HASH([ProductKey])
)
;
Query Execution Query
Result

Control
Compute
Storage
Azure Analysis Services
Azure Analysis Services
Enterprise grade analytics engine as a service

Build rich Gain insights at the Proven Provision and


semantic models speed of thought technology scale with ease
Transform complex Gain instant insights Based on powerful, Easy to deploy, scale,
data into business with in-memory cache proven SQL Server and manage as a
user friendly semantic using your preferred Analysis Services platform-as-a-service
models visualization tools solution
Azure Analysis Services
Azure Analysis Services is based on proven SQL Server technology
Data sources BI semantic model Client tools

Cloud Cloud

SQL Database Security

SQL Data Warehouse Data modeling Power BI

Other data sources In-memory Power BI Embedded


cache

On-premises On-premises
Azure Analysis Services
SQL Server Power BI Desktop

Analytics platform
system Excel
Lifecycle management Business logic & metrics
Other data sources
Third party BI tools
SQL Server Analysis Services

Visualizations & insights

Power BI

Other 3rd party tools & services

On-prem data sources Authoring & Dev

SQL Server Visual Studio SQL Server


Reporting Services
SQL Server
Other SSMS
data sources Analysis Services
Azure Analysis Services

Azure
Analysis Services
Cloud data sources
Visualizations & insights
SQL Database
Power BI

SQL Other 3rd party tools & services


Data Warehouse

On-prem data sources Authoring & Dev

SQL Server Visual Studio

Other SSMS
data sources
Transform data into intelligent action
Information Big Data Stores Machine Learning Intelligence
Management and Analytics
Data People
Sources
Machine Cognitive
Data Factory Data Lake Store
Learning Services

SQL Data Data Lake Bot Web


Data Catalog Warehouse Analytics Service
Apps
HDInsight
Event Hubs (Hadoop and Cortana Mobile
Cosmos DB
Spark) Apps

Stream Analytics Bots


Dashboards &
IOT Hubs
Visualizations
Sensors Azure
Azure Analysis
Analysis
and Services
Services
devices Power BI
Azure Databricks
Automated
Systems

Data Intelligence Action


Information Management & Ingestion
Information
Management
Data
Sources
Data Factory

Data Catalog
Apps

Event Hubs

IOT Hubs
Sensors
and
devices

Data
Get more value from your enterprise data assets

Information
Management

Data Factory

Data Catalog

Event Hubs

IOT Hubs

• Spend less time looking for data, and more time getting value from it • Intuitive search and filtering to understand the data sources and their
• Register enterprise data sources, discover data assets and unlock their purpose
potential, and capture tribal knowledge to make data understandable • Let your data live where you want; connect using tools you choose
• Bridge the gap between IT and the business, allowing everyone to • Integrate into existing tools and processes with open REST APIs
contribute their insights, tags, and descriptions
Ingest events from websites, apps and devices at cloud scale

Information Data
Management sources

SQL Database Machine Learning

Data Factory

Apps Azure
API
Data Catalog Management Storage HDInsight

Sensors Event Hubs


Event Hubs and
devices
Backend Services
Stream Analytics Power BI
IOT Hubs Data

• Log millions of events per second in near real time • Get a managed service with elastic scale
• Connect devices using flexible authorization and throttling • Reach a broad set of platforms using native client libraries
• Use time-based event buffering • Pluggable adapters for other cloud services
• Get a managed service with elastic scale
Specialized Device Scenarios with IOT Suite
IoT Hub 10
1
0 0

IP-capable
1 1
1 0 1 100110
0 1 0 1010
1 0
0 1010
0111
Information
Management Event processing and insight
(e.g. hot and cold paths)

Data Factory
Cloud
PAN-devices

protocol
gateway
Device business logic,
Connectivity monitoring
Data Catalog

Field
gateway

Event Hubs
Application
device provisioning and management

IOT Hubs Data field cloud


• Support for multiple protocols (AMQP, MQTT)
• Bi-directional communication
• Device specific secuirty
• Multi-platform gateway SDK.
Big Data Stores
Information Big Data Stores
Management
Data
Sources
Data Factory Data Lake Store

SQL Data
Data Catalog Warehouse
Apps

Event Hubs Cosmos DB

IOT Hubs

Sensors
and
devices

Data
A hyper-scale repository for big data analytics workloads
Big Data Stores

ADL Analytics
Data Lake Store
Devices Social
HDInsight
SQL Data
Warehouse LOB ADL Store
Applications Video
R
Cosmos DB

Web Sensors
Spark

Relational Clickstream Machine Learning

• A Hadoop Distributed File System for the cloud • Massive throughput to increase analytic performance
• No fixed limits on file size • High durability, availability, and reliability
• No fixed limits on account size • Azure Active Directory access control
• Unstructured and structured data in their native format
Data lake is the center of a big data solution
A storage repository, usually Hadoop, that holds a vast amount of raw data in its native
format until it is needed.

• Inexpensively store unlimited data


• Collect all data “just in case”
• Store data with no modeling – “Schema on read”
• Complements EDW
• Frees up expensive EDW resources
• Quick user access to data
• ETL Hadoop tools
• Easily scalable
• With Hadoop, high availability built in
Data Lake layers
• Raw data layer– Raw events are stored for historical reference. Also called staging layer or
landing area
• Cleansed data layer – Raw events are transformed (cleaned and mastered) into directly
consumable data sets. Aim is to uniform the way files are stored in terms of encoding,
format, data types and content (i.e. strings). Also called conformed layer
• Application data layer – Business logic is applied to the cleansed data to produce data ready
to be consumed by applications (i.e. DW application, advanced analysis process, etc). This is
also called by a lot of other names: workspace, trusted, gold, secure, production ready,
governed
• Sandbox data layer – Optional layer to be used to “play” in.  Also called exploration layer or
data science workspace
Still need data governance so your data lake does not turn into a data swamp!
ADL Store vs Blob Store
Azure Azure
Data Lake Store Blob Storage
Purpose Optimized for big data analytics General purpose bulk storage

Use Cases Batch, Interactive, Streaming App backend, backup data, media storage
for streaming

Units of Storage Accounts / Folders / Files Accounts / Containers / Blobs

Structure Hierarchical File System Flat namespace

WebHDFS Implements WebHDFS No (WASB)

Security AD SAS keys

Storage Auto Shared/Files chunked Manually manage expansion/Files intact

Size Limits No limits on account size, file size, # files 500TB account, 4.75TB file

Service State Generally Available Generally Available

Billing Pay for data stored and for I/O Pay for data stored and for I/O

Region Availability Two US regions (East, Central) & North All Azure Regions
Europe (Other regions coming soon)
Azure Cosmos DB
A globally distributed, massively scalable, multi-model database service

Big Data Stores

Data Lake Store


MongoDB Table API

SQL Data
Warehouse

Column-family
Document
Cosmos DB
Key-value Graph

Guaranteed low latency at the 99 th percentile


Elastic scale out
of storage & throughput Five well-defined consistency models

Turnkey global distribution Comprehensive SLAs


Guaranteed low latency at P99 (99th percentile)
Requests are served from local region
Reads Indexed writes
(1KB) (1KB) Single-digit millisecond latency worldwide

Write optimized, latch-free database engine


P50 <2ms <6ms designed for SSD

Synchronous automatic indexing at sustained


P99 <10ms <15ms ingestion rates
Elastically scalable storage and throughput
Single machine is never a bottle neck
Provisioned request / sec

Black Friday
12000000
10000000
Transparent server-side partition management
8000000
6000000
4000000
Elastically scale storage (GB to PB) and throughput (100 to 100M req/sec)
across many machines and multiple regions
2000000

Nov 2016 Dec 2016

Time Automatic expiration via policy based TTL


Hourly throughput (request/sec)
Pay by the hour, change throughput at any time for only what you need
A document-oriented, NoSQL database service
Big Data Stores DocumentDB account Databases Collections Documents Attachments
101
{ } 010
{ }
Data Lake Store
Users Your documents here
SQL Data
Warehouse
JS Stored procedures

Cosmos DB

Permissions
JS Triggers

JS User-defined functions

• Designed for modern mobile and web applications • Enables complex ad hoc queries using a dialect of SQL
• Delivers consistently fast reads and writes, schema flexibility, and • Supports multi-document transaction processing using the
the ability to easily scale a database up and down on demand familiar programming model of stored procedures,
• Offers native support for JavaScript, SQL query, and transactions triggers, and UDFs
over JSON documents
Machine Learning and Analytics
Information Big Data Stores Machine Learning
Management and Analytics
Data
Sources
Machine
Data Factory Data Lake Store
Learning

SQL Data Data Lake


Data Catalog Warehouse Analytics
Apps
HDInsight
Event Hubs Cosmos DB (Hadoop and
Spark)

IOT Hubs Stream Analytics

Sensors Azure Analysis


and Services
devices
Azure Databricks

Data Intelligence
Big data analytics made easy
Machine Learning
and Analytics

Machine
Learning
Data Lake Analytics
Data Lake
Analytics

HDInsight
(Hadoop and
Spark)

Stream Analytics

Azure Analysis
Services SQL Data SQL Database Data Lake Store Storage Blobs SQL Database
Warehouse in a VM

• Analyze data of any kind and size • Managed and supported with an enterprise-grade SLA
• Develop faster, debug and optimize smarter • Dynamically scales to match your business priorities
• Interactively explore patterns in your data • Enterprise-grade security with Azure Active Directory
• No learning curve—use U-SQL (SQL with C#) • Built on YARN, designed for the cloud
Develop massively parallel programs with simplicity

@searchlog =
EXTRACT UserId int,
Start DateTime,
Region string,
Query string,
Duration int,
Urls string,
ClickedUrls string
FROM @"/Samples/Data/SearchLog.tsv"
USING Extractors.Tsv();

OUTPUT @searchlog
TO @"/Samples/Output/SearchLog_output.tsv"
USING Outputters.Tsv();

A simple U-SQL script can scale U-SQL automatically generates a scaled Execution nodes immediately
from Gigabytes to Petabytes without out and optimized execution plan to rapidly allocated to run the
learning complex big data handle any amount of data. program.
programming techniques.
Error handling, network issues, and
runtime optimization are handled
automatically.
1
DECLARE @endDate DateTime = DateTime.Now;
U-SQL
DECLARE @startDate DateTime = @endDate.AddDays(-7); 
Basics
@orders =  2
     EXTRACT 
         OrderId  int,  (1) DECLARE constant values using C#
         Customer string,  Expressions
         Date     DateTime, 
         Amount   float
(2) EXTRACT performs schema on
     FROM "/input/orders.txt"
     USING Extractors.Tsv(); read for files and places results in a
RowSet

@orders = SELECT * FROM @orders (3) RowSet refinement and dataflow


3
     WHERE Date >= startDate AND Date <= endDate; programming

(4) OUTPUT for writing files. Built-in


@orders = SELECT * FROM @orders
handling for CSV & TSV
     WHERE Customer.Contains(“Contoso”);

Notes:
OUTPUT @orders • Whole-script optimization
    TO "/output/output.txt"
    USING Outputters.Tsv();
4
Fully-managed Hadoop and Spark
Azure for the cloud
HDInsight 100% Open Source Hortonworks
data platform
Hadoop and Spark
Clusters up and running in minutes
as a Service on Azure
Managed, monitored and supported
by Microsoft with the industry’s best SLA
Familiar BI tools for analysis, or open source
notebooks for interactive data science
63% lower TCO than deploy your own
Hadoop on-premises*

*IDC study “The Business Value and TCO Advantage of Apache Hadoop in the Cloud with Microsoft Azure HDInsight”
Microsoft + Hortonworks
Promoting Open Hadoop

Engineering alignment
Corporate alignment
Field alignment
Hortonworks Data Platform (HDP) 2.6
(under the covers of HDInsight)

Simply put, Hortonworks ties all the open source products together (22)
Spark for Azure HDInsight
In Memory Processing on Multiple Workloads
• Single execution model for multiple
Spark
SQL
Spark
Streaming
Machine
Learning
Graph tasks

• Processing up to 100x faster


performance than Map Reduce

Core Engine • Developer friendly (Java, Python, Scala)

• BI tool of choice (Power BI, Tabelau,


Batch Script SQL NoSQL Streaming In Memory Qlik, SAP)
Azure Map Pig Hive Hbase Storm Spark
HDInsight reduce
• Notebook experience (Jupyter/iPython,
Core Engine
Zeppelin)
HDInsight Supports Hive
SQL-like queries on Hadoop data in HDInsight
HDInsight provides easy-to-use graphical query interface for Hive
HiveQL is a SQL-like language (subset of SQL)
Hive structures include well-understood database concepts such as tables, rows, columns, partitions
Compiled into MapReduce jobs that are executed on Hadoop

Dramatic performance gains with Stinger/Tez


Stinger is a Microsoft, Hortonworks and OSS driven initiative to bring interactive queries with Hive
Brings query execution engine technology from Microsoft SQL Server to Hive
Performance gains up to 100x
Microsoft contribution to Sample Query
Apache code

32x Speedup
1400s 40X
Speedup 100x
44.3s Speedup
35.1s 15s
HDP 2.1
Hive 10 HDP 1.3 / HDP 2.0
HDInsight Supports HBase
NoSQL database on data in HDInsight
Columnar, NoSQL database
Runs on top of the Hadoop Distributed File System (HDFS)
Provides flexibility in that new columns can be added to column families at any time

HMaster
Coordination

Name Node Region Server Region Server Region Server Region Server

Job Tracker

Data Node Data Node Data Node Data Node

Task Tracker Task Tracker Task Tracker Task Tracker


HDInsight Supports Storm
Stream analytics for Near-Real Time Processing
Consumes millions of real-time events from a scalable event broker (ie. Apache Kafka, Azure Event Hub)
Performs time-sensitive computation
Output to persistent stores, dashboards or devices
Customizable with Java + .NET
Deeply integrated to Visual Studio

Event Collection Event Queuing Transformation Long-term Presentation


producers System storage and action

Apache
Storm on HBase Web/thick client
Kafka / HDInsight
Applications RabbitMQ / dashboards
ActiveMQ HDFS

Azure DBs Search and query


Stream
Cloud gateways
Devices Event Hubs processin
Azure Stream
(web APIs) Azure
g
Analytics storage
Data analytics (Excel)
Sensor Live Dashboards
s

Field Storage
Web and Social gateways adapters Devices to take action
R Server for HDInsight

• Familiarity of R (most popular language for data


scientists)
• Scalability of Hadoop and Spark
• Up to 7x faster using Spark engine
• Train and run ML models on datasets of any size
• Cloud managed solution (easy setup, elastic,
SLA)

Only managed, cloud solution for doing R


Real-time stream processing in the cloud
Machine Learning
and Analytics
SQL Database
Machine
Learning 10
Event Hubs 10

Data Lake
Analytics Event Hubs Blob Storage
HDInsight
(Hadoop and Stream
Spark)
10 Analytics
Stream Analytics 10 Power BI Table Storage

Azure Analysis
Services Blob Storage

• Perform real-time analytics for your Internet of Things solutions • Create real-time dashboards and alerts over data from
• Stream millions of events per second devices and applications
• Correlate across multiple streams of data
• Get mission-critical reliability and performance with predictable
results • Use familiar SQL-based language for rapid development
Tumbling Window

SELECT System.TimeStamp AS OutTime, TollId,


COUNT (*)
FROM Input TIMESTAMP BY EntryTime
GROUP BY TollId, TumblingWindow(minute,5)
Streaming Options with Microsoft

Greatest
SIMPLICITY. Maximum
Lowest TCO
Job Service Cluster Service CONTROL

Fully Managed Integrations with


managed -
SQL like Machine Integrations for Custom Spark
SLA with 3
9s of
Kafka, ipython Full stack
query Learning real-time code
3 9s of
language integrations dashboarding extensions
Streaming & availability
notebook R-
Server
control
availability Storm

Azure Stream Analytics Spark Streaming & Storm (HD Insight) Virtual Machines
AZURE DATABRICKS

Azure Databricks
Collaborative Workspace

Machine learning models


IoT / streaming data
DATA ENGINEER DATA SCIENTIST BUSINESS ANALYST

Deploy Production Jobs & Workflows


BI tools
Cloud storage

MULTI-STAGE PIPELINES JOB SCHEDULER NOTIFICATION & LOGS

Data warehouses
Optimized Databricks Runtime Engine Data exports

Hadoop storage
DATABRICKS I/O APACHE SPARK SERVERLESS Rest APIs
Data warehouses

Enhance Productivity Build on secure & trusted cloud Scale without limits
KNOWING THE VARIOUS BIG DATA SOLUTIONS

CONTROL EASE OF USE

Azure Data Lake


Reduced Administration
Analytics
Azure Databricks

Azure HDInsight

ANALYTICS
BIG DATA
Azure Marketplace
HDP | CDH | MapR

Any Hadoop technology, Workload optimized, Frictionless & Optimized Data Engineering in a
any distribution managed clusters Spark clusters Job-as-a-service model

IaaS Clusters Managed Clusters Big Data as-a-service


Azure Data Lake
Analytics
Azure Data Lake Store

BIG DATA
STORAGE
Azure Storage
Intelligence
Information Big Data Stores Machine Learning Intelligence
Management and Analytics
Data
Sources
Machine Cognitive
Data Factory Data Lake Store
Learning Services

SQL Data Data Lake Bot


Data Catalog Warehouse Analytics Service
Apps
HDInsight
Event Hubs Cosmos DB (Hadoop and Cortana
Spark)

IOT Hubs Stream Analytics

Sensors Azure Analysis


and Services
devices
Azure Databricks

Data Intelligence
Dashboards & Visualizations
Information Big Data Stores Machine Learning Intelligence
Management and Analytics
Data
Sources
Machine Cognitive
Data Factory Data Lake Store
Learning Services

SQL Data Data Lake Bot


Data Catalog Warehouse Analytics Service
Apps
HDInsight
Event Hubs Cosmos DB (Hadoop and Cortana
Spark)

Stream Analytics
IOT Hubs Dashboards &
Visualizations
Sensors Azure
Azure Analysis
Analysis
and Services
Services
devices Power BI
Azure Databricks

Data Intelligence
Power BI: experience your data
Any data, any way, anywhere

Web

Mobile
Out-of-the-box SaaS content packs

Real time dashboards & interactive reports Excel


Cloud data
Natural Language query
Embedded
Custom visualizations

Native Office 365 integration Cortana

Power BI
On-premises
“……”
data
Power BI product portfolio

Author Share and collaborate Large scale deployments Share and collaborate App dev

</>

Power BI Power BI Power BI Power BI


Power BI Embedded
Desktop service Premium Report Server

Free data analysis Cloud-based modern Dedicated capacity for On-premises report server Visual analytics embedded
and report authoring tool business analytics solution increased performance in your applications
Power BI Desktop
Free companion authoring tool for the Power BI service

Get Data
Easily connect, clean, and mashup data

Analyze
Build powerful models and flexible measures

Visualize
Create stunning interactive reports

Publish
Share insights with others

Download free from PowerBI.com or the Windows Store


Collaborate
Empower your organization with self-service analytics
Power BI service
Cloud-based SaaS solutions

Get started quickly

Secure, live connection to your data sources,


on-premises and in the cloud

Auto insights and intuitive data exploration using


natural language query

Pre-built dashboards and reports for popular SaaS


solutions

Live, real-time dashboard updates

Deliver insights through other services such as


SharePoint, PowerApps and Teams
Power BI Premium
Dedicated resources in the cloud
Power BI service – Contoso organization

Shared capacity
Greater scale and performance Premium capacity – P1

Premium capacity – P2
My workspace
Premium capacity – P3 User 2
Flexibility to license by capacity
App workspace My workspace
Sales User 3
My workspace
App workspace
User 1
Extending on-premises capabilities Marketing
APIs
Custom app

Power BI Premium
Power BI Report Server

On-premise reporting solution


Power BI reports and SSRS report on-premises

Connect to data
Over 70+. Data can be imported, queried directly or live connection to SSAS

Power BI reports
Fully interactive reports on-premises to visualize your data and gain insights

SSRS reports
Precisely formatted operational reports

Consume in multiple ways


Mobile Apps, Web Portal, Embedded in your organizational apps
Power BI Embedded
Embed stunning, fully interactive reports and visuals into your applications

Spend time focusing on your product instead of


building visual analytic features from scratch

Connect to countless data sources so that you can


expose insights to your customers

Take advantage of our existing Custom Visual library


or dream up the right visual for your customers

Embed consistent, easy-to-navigate visualization


experiences across any device

Leverage other familiar services like Visual Studio,


Azure Web Aps and other Azure services
Transform data into intelligent action
Information Big Data Stores Machine Learning Intelligence
Management and Analytics
Data People
Sources
Machine Cognitive
Data Factory Data Lake Store
Learning Services

SQL Data Data Lake Bot Web


Data Catalog Warehouse Analytics Service
Apps
HDInsight
Event Hubs (Hadoop and Cortana Mobile
Cosmos DB
Spark) Apps

Stream Analytics Bots


IOT Hubs Dashboards &
Visualizations
Sensors Azure
Azure Analysis
Analysis
and Services
Services
devices Power BI
Power BI
Azure Databricks
Automated
Systems

Data Intelligence Action


© 2016 Microsoft Corporation. All rights reserved.

You might also like