Twoday Kapacity Microsoft Fabric Event 281123 Faelles Spor

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 63

Live-event:

Seneste nyt om
Microsoft Fabric
Lyngby, den 28. november
Just Blindbæk
https://www.linkedin.com/in/blindbaek/

just.blindbaek@twoday.com
Program
– 09.10: Microsoft Fabric er ude – hvad betyder det for dig?

– 10.00: Pause

– 10.15: Indblik i de nye unikke features i Microsoft Fabric

– 10.55: Sådan udnytter du det fulde potentiale i Microsoft Fabric


fra start

– 11.15: Spørgsmål & opsamling

– 11.30: Frokost

– 12.15: Spor

– 13.30: Kaffe og kage


3
Dagens to spor senere

Power BI
Self-service End-to-end
demo Lakehouse demo
Lokale 1.06 (1.sal) Her i lokalet

4
Title Slides

Analytics in the Era of AI


The 2023 ML, AI, and Data Landscape
Analytics is complex and fragmented
Every project has many
subsystems

Every subsystem need a


different class of product

Products often comes from


multiple vendors

Integration is complex, fragile


and expensive
“Simplify,
I am the Chief Data
Officer and don’t want
to be the Chief
Integration Officer.”
– Every CDO, Every Enterprise
A silver lining?
Analytics has very
predictable patterns
Data Data Data Real-Time Data Business
Integration Engineering Warehousing Analytics Science Intelligence

Microsoft has all the


products with the right scale
needed to build a complete
analytics system
Data Lake

Governance and Administration


Product Leadership
Data Integration Analytics Artificial Intelli
ABILITY TO EXECUTE

ABILITY TO EXECUTE

ABILITY TO EXECUTE
COMPLETENESS OF VISION As of August 2022 COMPLETENESS OF VISION As of September 2022 COMPLETENESS OF VISION As o

Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest ratings or other designation. Gartner research publications consist of the
opinions of Gartner’s research organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research, including any warranties of merchantability or fitness for a particular
purpose. This graphic was published by Gartner, Inc. as part of a larger research document and should be evaluated in the context of the entire document. The Gartner document is available upon request from Microsoft.
Product Leadership
alytics Artificial Intelligence Business Intelligence

ABILITY TO EXECUTE

ABILITY TO EXECUTE
As of September 2022 COMPLETENESS OF VISION As of January 2022 COMPLETENESS OF VISION As of January 2023

Gartner does not endorse any vendor, product or service depicted in its research publications, and does not advise technology users to select only those vendors with the highest ratings or other designation. Gartner research publications consist of the
opinions of Gartner’s research organization and should not be construed as statements of fact. Gartner disclaims all warranties, expressed or implied, with respect to this research, including any warranties of merchantability or fitness for a particular
purpose. This graphic was published by Gartner, Inc. as part of a larger research document and should be evaluated in the context of the entire document. The Gartner document is available upon request from Microsoft.
Still far too complex
Many Products

Different Experiences
Purview
Proprietary and Open Azure AI
Dedicated and Serverless Power BI
PaaS and SaaS Synapse DW

Different Business Models Kusto


Steep Learning Curves Synapse Spark

Deep Expertise Needed Data Factory

High Integration Effort


General availability

Microsoft Fabric
The data platform for the era of AI

Complete Lake-centric Empower every


AI-powered
analytics platform and open business user
Biggest release of
Microsoft a data product
since the launch of
SQL Server

Satya Nadella
May 2023
Data analytics for the era of AI
Microsoft Fabric
The unified data platform for the era of AI

Data Synapse Data Synapse Data Synapse Data Synapse Real


Power BI Data Activator
Factory Engineering Science Warehousing Time Analytics

OneLake
Microsoft Fabric
The data platform for the era of AI

Complete Lake centric Empower AI


Analytics and open Every Business Powered
Platform User

Everything, unified OneLake Familiar and intuitive Copilot accelerated

SaaS-ified One Copy Built into Microsoft 365 ChatGPT on your data

Secured and governed Open at every tier Insight to action AI driven insights
Microsoft Fabric
The data platform for the era of AI

Complete Lake centric Empower AI


Analytics and open Every Business Powered
Platform User

Everything, unified OneLake Familiar and intuitive Copilot accelerated

SaaS-ified One Copy Built into Microsoft 365 ChatGPT on your data

Secured and governed Open at every tier Insight to action AI driven insights
Microsoft Fabric

Single…
Onboarding and trials
Data Synapse Data Synapse Data Synapse Data Synapse Real Power BI Data Sign-on
Factory Engineering Science Warehousing Time Analytics Activator
Navigation model
UX model
AI Assisted Workspace organization
Collaboration experience
Shared Workspaces Data Lake
Storage format
Universal Compute Capacities Data copy for all engines
Security model
OneSecurity CI/CD
Monitoring hub
OneLake Data hub
Governance &
compliance
Intelligent data foundation
SaaS
“It just works”

Success Centralized
5x5
by default administration

Frictionless onboarding Minimal knobs Tenant-wide governance

Centralized
Instant provisioning Auto-optimized
security management

Quick results w/ Intuitive UX Auto-integrated Compliance built-in


Microsoft Fabric
The data platform for the era of AI

Complete Lake centric Empower AI


Analytics and open Every Business Powered
Platform User

Everything, unified OneLake Familiar and intuitive Copilot accelerated

SaaS-ified One Copy Built into Microsoft 365 ChatGPT on your data

Secured and governed Open at every tier Insight to action AI driven insights
OneLake for all Data
“The OneDrive for Data”
A single SaaS lake for the
whole organization

Data Synapse Data Synapse Data Synapse Data Synapse Real Power BI Data Provisioned automatically with
Factory Engineering Science Warehousing Time Analytics Activator the tenant

All workloads automatically


store their data in the OneLake
workspace folders

All the data is organized in an


intuitive hierarchical
namespace

OneLake
The data in OneLake is
automatically indexed for
discovery, MIP labels, lineage,
Intelligent data foundation PII scans, sharing, governance
and compliance
One Copy for all computes
Real separation of compute and storage
All the compute engines store
their data automatically in
OneLake

Data Synapse Data Synapse Data Synapse Data Synapse Real Power BI Data The data is stored in a single
Factory Engineering Science Warehousing Time Analytics Activator common format

Delta – Parquet, an open


standards format,
is the storage format for all
tabular data in Analytics vNext
Spark T-SQL Serverless KQL
Analysis
Services

Compute Once data is stored in the lake, it


is directly accessible by all the
engines without needing
any import/export

All the compute engines have


Customers Service Business been fully optimized to work with
Finance
360 Telemetry KPIs Delta Parquet as their native
format
Delta – Parquet Delta – Parquet Delta – Parquet Delta – Parquet
Format Format OneLake Format Format
Shared universal security model is
enforced across all the engines
Semantic Models
Database files

“Direct Query Mode”


SQL DAX
Data Queries Power BI Queries
Tables Scan Warehouse/ Analysis
Slow, but real time
Reports
Lakehouse Services

Storage

Database files

“Import Mode”
DAX
Data Queries
Power BI
Tables Scan Warehouse/ Import
Analysis
Latent & duplicative but fast
Reports
Lakehouse Services

Storage
Copy of
Tables
Semantic Models
Database files

“Direct Query Mode”


SQL DAX
Data Queries Power BI Queries
Tables Scan Warehouse/ Analysis
Slow, but real time
Reports
Lakehouse Services

Storage

Database files

“Import Mode”
DAX
Data Queries
Power BI
Tables Scan Warehouse/ Import
Analysis
Latent & duplicative but fast
Reports
Lakehouse Services

Storage
Copy of
Tables

Parquet/Delta Lake

“Direct Lake Mode”


DAX
Data Queries
Power BI
Tables Warehouse/ Scan
Analysis
Perfect!
Reports
Lakehouse Services

OneLake
Taking One Copy to the Next Level
Shortcuts
Sharing data in OneLake is as
easy as sharing files in
OneDrive, removing the needs
for data duplication
Data Synapse Data Synapse Data Synapse Data Synapse Real Power BI Data
Factory Engineering Science Warehousing Time Analytics Activator With shortcuts, data
throughout OneLake can be
composed together without
any data movement

Shortcuts also allow instant


linking of data already existing
Customers Service Business in Azure and in other clouds,
Finance Telemetry KPIs
OneLake
360 without any data duplication
and movement, making
OneLake the first multi-cloud
data lake

With support for industry


standard APIs, OneLake data
can be directly accessed by
any application or service
Azure Amazon Google (coming soon)
Microsoft Fabric + Azure Databricks
Better together

Azure
Databricks

Data Synapse Data Synapse Data Synapse Data Synapse Real


Power BI Data Activator
Factory Engineering Science Warehousing Time Analytics

DFS API

OneLake Shortcut
Mirroring in Microsoft Fabric
Frictionless Linking of External Databases

TDS XML/A Delta

Fabric Power Fabric


Snowflake DW BI Spark

CDC
(Proprietary DB Delta Parquet in OneLake
Replicator
format)
Consumption replica

Mirroring Preview for SQL, Cosmos and Snowflake


Introducing Mirroring in Microsoft Fabric
OneLake for OneLake
all domains
Sales Customer

A true hub & spoke HR IT Marketing


data mesh across
organizational data
domains. Finance Operations

Workspaces and artifacts for Without data movement, Data is secured and governed in Data can be certified by domain
different data domains, contribute data from different domains one place while remaining easily experts to enabling trust for
to building the same data lake can be analyzed, blended discoverable and accessible to data which is discovered
and transformed together all who should have access
across the organization
DEMO: Getting started
with Fabric
Microsoft Fabric
The data platform for the era of AI

Complete Lake centric Empower AI


Analytics and open Every Business
Office Powered
Platform Use
User

Everything, unified OneLake Familiar and intuitive Copilot accelerated

SaaS-ified One Copy Built into Microsoft 365 ChatGPT on your data

Secured and governed Open at every tier Insight to action AI driven insights
Power BI

Data Activator
Announcing

Data Activator
Introducing Data Activator
Teams

Power BI
Data Activator Outlook

Power
Synapse No-code user experiences Automate
Data Warehouse

Synapse Custom
Real Time Analytics Models Rules Triggers

… Real-time stream processing …


Examples

Greater than Crosses above Crosses above and Crosses above 3 times
a threshold a threshold stays above in an hour

“Alert me when the number “Alert an account manager “File a ticket upon any “File a ticket whenever any
of support calls exceeds 100 upon any invoice becoming freezer crossing and staying freezer exceeds 32F three
on any given day.” 10 days overdue.” above 32F for 15 minutes.” times in an hour.”

Decreases by 10 Decreases by 10% Rolling average Exceeds 1 standard


decreases by 10% deviation
“Alert the product manager “Alert the product manager
when revenue decreases by when revenue decreases by “Alert when the 6-hour “Alert when sensor readings
$10k in one day.” 10% in one day.” rolling average of sensor are is one stdev outside of
readings decreases by 10%.” 7-day rolling average.”
Microsoft Fabric
The data platform for the era of AI

Complete Lake centric Empower AI


Analytics and open Every Business Powered
Platform User

Everything, unified OneLake Familiar and intuitive Copilot accelerated

SaaS-ified One Copy Built into Microsoft 365 On your data


ChatGPT on your data

Secured and governed Open at every tier Insight to action AI driven insights
Copilot in
Microsoft Fabric
Thank you
Indblik i de nye
unikke features i
Microsoft Fabric
Peer Grønnerup
https://www.linkedin.com/in/peergroennerup/

peer.gronnerup@twoday.com
Microsoft Fabric – SaaS Analytics Platform

Data
Warehousing

Data Data
Engineering Science

Real-Time
Data Factory
Analytics

Data Activator Power BI

OneLake
44
Microsoft Fabric – Fokus på de nyeste unikke features

Real-time Analytics & Data Activator Semantic Link

Real-Time Data Activator Power BI Data Data Power BI


Analytics Science Engineering

OneLake

45
Real-Time Analytics i Fabric

Unikke features i Fabric Real-Time Analytics

Real-Time
Analytics • Fuld end-to-end understøttelse for ingest, transform og routing af
realtidsdata fra multiple kilder og i et hvilket som helst data format.
Data service optimeret for streaming og tidsseriedata • Eksekver analytiske forespørgsler direkte på rådata uden
og med overlegen performance i håndtering af transformation
strukturet, semi-strukturet og ustrukturet data. • Håndter strukturerede, semi-strukturerede og fritekst data
• Håndter ubegrænsede datamængder
Fuldt integreret i Fabric økosystemet.
• Fuldt integreret med andre Fabric experiences og items

Baseret på Kusto DB (Azure Data Explorer)


Items i Fabric Real-Time Analytics

• KQL Database – Database der anvender OneLake som storage


• KQL Queryset – For håndtering og deling af forespørgsler (KQL)
• Eventstream – Streaming data hub med multiple inputs og outputs

46
Real-Time Analytics i Fabric - Showcasen

• Custom App leverer sensor data fra


pools med temperaturmålinger
• Custom App er source i vores
Eventstream
• Som destination har vi defineret:
• Kusto DB
• Lakehouse
• Reflex

47
Live demo
Real-time Analytics
Data Activator

49
Data Activator - Showcasen

Data Activator

• Events routes til Reflex


• Reflex indeholder information om:
• Events
• Objekter
• Triggers
• Properties

50
Live demo
Data Activator
Semantic Link

Semantic Link

1 Adgang til Power BI semantiske modeller fra notebooks

2 Mulighed for at tilgå semantisk information i modeller

Direct Lake Apace Spark

52
Semantic Link

1 Adgang til Power BI semantiske modeller fra notebooks


Semantic Link

2 Mulighed for at tilgå semantisk information i modeller

Berigelse af data via ML workloads


Forvarmning af DirectLake datasets
Automatiseret dokumentation, validering og kvalitetssikring
Direct Lake Apace Spark

Export af data til (3. part, API’er m.m.)


Og et utal af andre scenarier….
Åbner for helt nye muligheder i samarbejdet om data

53
Semantic Link

1 Adgang til Power BI semantiske modeller fra notebooks

Data Power BI 2 Mulighed for at tilgå semantisk information i modeller


Engineers udviklere

Data BI
Scientists Analysts Berigelse af data via ML workloads
Forvarmning af DirectLake datasets
Automatiseret dokumentation, validering og kvalitetssikring
Export af data til (3. part, API’er m.m.)
Power BI Og et utal af andre scenarier….
Semantic Models
Åbner for helt nye muligheder i samarbejdet om data

54
Live demo
Semantic Link
Sådan udnytter du
det fulde potentiale i
Microsoft Fabric fra
start

56
Aktivering af Fabric

– Microsoft Fabric tenant admin-switch lader organisationer, der


bruger Power BI, vælge at bruge Microsoft Fabric.

– Microsoft Fabric bliver tilgængeligt for alle i tenanten eller for en


udvalgt gruppe af brugere.

– Der er også en switch til at aktivere prøveversion af Fabric med


60 dages betalte funktioner. Prøvekapaciteterne kan deles med
andre brugere.

– Microsoft Fabric kan også aktiveres på kapacitetsniveau.

57
Hvad koster det?

– Den officielle prisliste er nu tilgængelig

– Kapacitet er for Fabric, hvad CPU er for en PC

– Men hvor meget "juice" skal der til for at...

Få tonsvis af data igennem med Pipelines

Kør notebooks med Spark-motoren

Udfør Stored Procs i et Warehouse

Afvikle Power BI-forespørgsler til OneLake SKU pricing in DKK at West Europe.

58
Hvordan styrer man en kapacitet?

Altid toppræstation
• Bursting tildeler
automatisk ressourcer
efter behov for at udføre
med maksimal ydeevne.
• Samme mængde
arbejde, bare afsluttet
hurtigere.

59
Hvordan styrer man en kapacitet?

Eliminer “workload
management”
• Store job påvirker ikke
andre job, der kører
• Udglatning forhindrer
spikey-belastninger
• Køb kapacitet for
gennemsnits-belastning
– ikke max!

60
Hvordan styrer man en kapacitet?

Altid på auto-pilot
• Selvstyret system
• Fuldstændig
gennemsigtighed i
brugen på ethvert
niveau.

61
Udnyt de fordele et Data Lakehouse giver
– Dekobling af compute og storage
– Minimering af omkostninger
– Øget fleksibilitet
– Lettere samarbejde på tværs af projekter og afdelinger

– Udnyt mulighederne med PySpark


– Nem håndtering af semi-strukturerede data
– Automatiseret datarensning og validering
– Bedre framework med et programmeringssprog

– Eller udnyt at Dataflows er kommet i ny version

62
Vær bevidst om at det er en ny platform
– Der vil være børnesygdomme
– Stabilitet og modenhed er måske ikke helt der hvor vi ønsker os
– Microsoft arbejder hurtigt
– Men Databricks kan stadig være et bedre valg i nogle scenarier

– Start evt. med PoC


– Migrering af eksisterende DWH løsning
– Ny løsning

– Kom godt fra start


– Deltag på vores 2-dages Microsoft Fabric kursus

63
Spørgsmål
og konkurrence

64
Så’ der frokost!
Vi ses igen kl. 12.15:

Spor 1: Power BI Self-service demo -> lokale 1.06 (1. sal)


Spor 2: End-to-end Lakehouse demo - > bliv i dette lokale

You might also like