This document discusses big data strategies and tools for analyzing big data using R. It describes how a big data solution requires technologies for data sources, integration, storage, data modeling, analytics, and visualization. It then explains Porter's model of generic business strategies for achieving competitive advantage through lower costs or product differentiation. Finally, it discusses how R can be used to design dashboards, do forecasting, machine learning, experiment design, text analysis, and data visualization for exploring big data using packages like ORCH and RHIPE that interface with Hadoop.
This document discusses big data strategies and tools for analyzing big data using R. It describes how a big data solution requires technologies for data sources, integration, storage, data modeling, analytics, and visualization. It then explains Porter's model of generic business strategies for achieving competitive advantage through lower costs or product differentiation. Finally, it discusses how R can be used to design dashboards, do forecasting, machine learning, experiment design, text analysis, and data visualization for exploring big data using packages like ORCH and RHIPE that interface with Hadoop.
This document discusses big data strategies and tools for analyzing big data using R. It describes how a big data solution requires technologies for data sources, integration, storage, data modeling, analytics, and visualization. It then explains Porter's model of generic business strategies for achieving competitive advantage through lower costs or product differentiation. Finally, it discusses how R can be used to design dashboards, do forecasting, machine learning, experiment design, text analysis, and data visualization for exploring big data using packages like ORCH and RHIPE that interface with Hadoop.
tools which range from technologies dealing with data sources, integration and data stores, to technologies which help with the creation of data models, presenting these through visualization and reporting. Big Data Solution • Data Sources • Integration and Data Storage • Data models and analytics • Visualization and Reporting Application of Big Data Porter's model Porter's model • Porter suggested four "generic" business strategies that could be adopted in order to gain competitive advantage. • The key strategic challenge for most businesses is to find a way of achieving a sustainable competitive advantage over the other competing products and firms in a market. • A competitive advantage is an advantage over competitors gained by offering consumers greater value, either by means of lower prices or by providing greater benefits and service that justifies higher prices. Porter's model • The differentiation and cost leadership strategies seek competitive advantage in a broad range of market or industry segments. • By contrast, the differentiation focus and cost focus strategies are adopted in a narrow market or industry. • Cost Leadership : With this strategy, the objective is to become the lowest-cost producer in the industry. • Cost Focus : Here a business seeks a lower-cost advantage in just one or a small number of market segments. • Differentiation Focus : In the differentiation focus strategy, a business aims to differentiate within just one or a small number of target market segments. • Differentiation Leadership : With differentiation leadership, the business targets much larger markets and aims to achieve competitive advantage through differentiation across the whole of an industry. Unit V Exploring machine learning tool with Big Data Exploring Big Data with R
R is a programming language and free software
environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis. What can you do in R? • Design interactive dashboards • Forcasting • Train machine learning models • Experiment design • Visualize and publish data insight • Make prediction • Text analysis • Predict customer R-packages • The ORCH package ORCH stands for ‘Oracle R connector for Hadoop’. It consists of several packages that provide access to a Hadoop cluster, to achieve manipulation of HDFS resident data and the execution of MapReduce jobs. • The RHIPE package • RHIPE is another package that provides an API to use Hadoop with R for Big Data analytics. It is more integrated with plots. Thank You!!!!