Download as pdf or txt
Download as pdf or txt
You are on page 1of 23

Oracle Big Data Preparation Cloud Service

Transform to Self Service Data Preparation for Business Users

Jernej Kase
Cloud & Digital Partner Programs,
Alliance & Channels Oracle EMEA

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | Oracle Confidential – Internal/Restricted/Highly Restricted
Safe Harbor Statement
The following is intended to outline our general product direction. It is intended for
information purposes only, and may not be incorporated into any contract. It is not a
commitment to deliver any material, code, or functionality, and should not be relied upon in
making purchasing decisions. The development, release, and timing of any features or
functionality described for Oracle’s products remains at the sole discretion of Oracle.

Copyright © 2014 Oracle and/or its affiliates. All rights reserved. |


“Big Data’s dirty little secret is that 90% of
time spent on a project is devoted to
preparing data… After all the preparation
work, there isn’t enough time left to do
sophisticated analytics on it…”
Source: Thomas Davenport - Wall Street Journal, 2014

In the past year, data preparation has


become indispensable due to its
overwhelming contribution to analyses
and decision support.
Source: Gartner (http://blogs.gartner.com/lakshmi-
randall/2015/05/11/whats-next-data-preparation/)

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 4


Companies are struggling to derive value
from big data initiatives…
Traditional methods
Enterprise
ETL & Data
Integration

Internet Enterprise
90% of time is MONTHS of effort
spent WRANGLING spent on each new Reporting
DATA dataset

Logs
Data Discovery
PROGRAMERS writing scripts
or complex ETL & Visualization

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 5


Oracle’s Solution: Big Data Preparation Cloud Service
Data Preparation
Any-Structured ETL  Designed for data domain
Business Data
Processing experts, not programmers
 Focused on cleansing,
Data enriching and transforming
Visualization unstructured business data
 Operationalize data flows into
Enterprise ETL or Business Intelligence
Reporting
Key Benefits
 Easy to get started with
Ingest from Sources Enrich Publish browser based application
 Better recommendations
engine combines machine
Apache Spark ML + Hadoop + Semantic Graph learning with semantic
technologies
 Integrated into Oracle Cloud

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 6


Load Data for Oracle Business Intelligence Cloud Service
Easy Publishing to Oracle BICS
 Pre-integrated RESTful service
connectivity
 Shared single sign on access
across Oracle Cloud
 Common operational support
BDP-CS BICS from OPC services

Key Benefits
 Self Service access from non
technical users
 Cloud based applications can
bypass need for extensive IT
support for on premise tech
 Accelerated Value by enabling
business users to quickly
ingest data for operational BI

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 7


Intuitive User Interface
Integrated Data Verification, Transformation, and Visualization

Interactive
Transform Script
Profile Metrics
Metadata and Data Views and
Visualizations
Knowledge Driven
Recommendations

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 8


Highly Differentiated from Other Data Prep Tools
Better Recommendations Engine
Spark  Only Data Prep/Wrangling tool
Machine to combing Natural Language
Semantics Learning
based Processing (Apache NLP) with
Knowledge Machine Learning (Spark ML)
Graph Natural
Language  Leverages Linked Open Data
Processing graph of domain knowledge

Key Benefits
Hadoop  More Efficient Mapping by
leveraging a more effective
recommendation service
Oracle  Higher Quality automation
Cloud
“gets it right” more often
 Data Enrichment leveraging
domain data for enriching
sparse data sets

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 9


Oracle Big Data Preparation Core Capabilities
• Unified solution to prepare
Ingest Enrich Publish unstructured data
• Import & Ingest • Profile • On Demand or
• Simple to use tooling designed
• Detect Schema • Annotate Scheduled
• Cleanse, Normalize • Data Classification • Source / Target for non-programmers
& De-duplicate • Semantic Definition
• Detect & Mask Enrichments • Restful APIs • Unique technology approach
Sensitive Data • Missing Data • Event Driven combines Machine Learning
Interpolation (ML) with Natural Language
Processing (NLP) engine

Govern and Monitor • Powered by Apache Spark,


Hadoop, and UIMA
• Dashboards • Reusable user policies • Security
• Automated Alerts • System Controls • Stats & Metrics • Cloud operated from the Oracle
Public Cloud

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 10


Big Data Preparation and Enrichment Examples
Supported Formats
Repair App Data
Invalid
emails Credit
Structured Invalid and missing data
Unreliable Sensitive data SSN Card
Info

Parse Click Stream Logs

Embedded information
Unstructured
No reliable patterns
High Velocity

Classify Social Data


Embedded information Entities
NLP in unstructured text
Unstructured
High Volume

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 11


What’s New

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 12


BDP 16.1.3
• Runtime Null Checks
– Allows users to define a runtime
threshold for number of nulls for a
column.
– System checks for nulls at runtime
and will throw error when user
defined threshold is violated
– Job details will give users information
about which columns violated DQ
checks

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 13


Custom Reference Knowledge
• Knowledge Import Feature
– Allows users to import custom
knowledge in seconds (CSV or TSV)
– Custom Knowledge used to enhance
Knowledge Service capabilities
• Knowledge Maintenance Page
– Allows users to manage reference
knowledge
• Create
• Rename
• Activate and Deactivate
• Delete Custom Knowledge
Knowledge Service
Import

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 14


Data Blending Assisted by Relationship Discovery
• Data Blending
– Empowers Business Analysts to Blend
Datasets from Multiple Sources, in any
format into a single enriched file ready
for downstream processes.
• Cross Source Relationship Discovery
– Assists Business Analysts by discovering
and recommending relations between
two datasets that can be used as blend
keys
– Powerful algorithm leverages deep
column profile and fingerprinting

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 15


New and Improved User Interface/User Experience

• New Home Screen


– Real-time Processing Metrics
– Quick Start Menu with Video Assist
– Quick Links to Documentation
• New Simplified Creation Flows
– Asynchronous Ingestion Process

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 16


!
Accelerate Lower Development Deeper Insights Reduce Risks
Analytics Costs with Trustworthy Data

Faster insights from No more custom Consistent, complete, Avoid costly and error
growing data sources coding trustworthy data prone data curation
efforts
Increased collaboration Less IT time spent Data driven
on data set creation decision making Data experts work
between business and IT directly on the data –
for data preparation Data experts curate Governed data preparation not through requirements
the data docs

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 17


Q&A

Jernej Kase

ISV Migration Center blog: http://blogs.oracle.com/imc


ISV Migration Center email: partner.imc@beehiveonline.oracle.com

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. |


Big Data Preparation Cloud Service
Demo

Jernej Kase
Cloud & Digital Partner Programs,
Alliance & Channels Oracle EMEA

Copyright © 2015, Oracle and/or its affiliates. All rights reserved. | 19


Getting started with Big Data Preparation Cloud Service

cloud.oracle.com/big-data-preparation

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 20


Copyright © 2016, Oracle and/or its affiliates. All rights reserved. | 21
Oracle Partner Hub ISV Migration Center
Oracle.com Partner Hub twitter.com/OracleIMC
Team Info, Events/Activities
Schedule, etc plus.google.com/+OracleIMC

Migration Center Team Blog facebook.com/OracleIMC


Webcasts, Howto, Demos, Guides, linkedin.com/groups/Oracle-Partner-Hub-Migration-
etc Center-4535240
Youtube: OracleIMCteam
feeds.feedburner.com/oracleimc
Slideshare: Oracle_IMC_team

Partner.IMC@beehiveonline.oracle.com

Copyright © 2016, Oracle and/or its affiliates. All rights reserved. |

You might also like