Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Eight Storage Requirements

for AI and Deep Learning


Data is the life-blood of artificial intelligence and deep learning
(AI and DL). Vast quantities of training data enhance accuracy in
the search for potentially predictive relationships.

Here are eight specific storage requirements of AI and DL applications and why they demand
the data management capabilities supplied by enterprise object storage solutions.

1. SCALABILITY 2. COST EFFICIENCY 3. SOFTWARE-DEFINED


Artifical intelligence systems can A useful storage system must be both STORAGE OPTIONS
process vast amounts of data in scalable and affordable, two attributes Vast data sets will sometimes require
a short timeframe—an essential that don’t always co-exist in enterprise hyperscale data centers with purpose-
attribute since large data sets storage. Historically, highly-scalable built server architectures already in
are required to deliver accurate systems have been more expensive on place. Other deployments may benefit
algorithms. This data volume drives a cost/capacity basis. Large AI data from the simplicity of pre-configured
significant storage demands. sets are not feasible if they break the appliances.
Microsoft, for example, required five storage budget.
years of continuous speech data HOW CLOUDIAN HELPS
to teach computers to talk. Tesla is HOW CLOUDIAN HELPS Object storage keeps your deployment
teaching cars to drive with 1.3 billion Object storage is built on the industry’s options open, with your choice of
miles of driving data. Managing these lowest cost hardware platform. storage appliances or software-
data sets requires a storage system Combine that with low management defined storage.
that can scale without limits. overhead and space-saving data
compression features, and the result
HOW CLOUDIAN HELPS is 70% less cost than traditional
Object storage is the only storage enterprise disk storage.
type that scales limitlessly within a
single namespace. Plus, the modular
design allows capacity to be added at
any time. You can scale with demand,
rather than ahead of demand.
4. HYBRID ARCHITECTURE 7. DATA LOCALITY
Different data types have varying While some AI/DL data will reside
performance requirements, and the in the cloud, much of it will remain
hardware must reflect that. Systems in the data center for a variety of
must include the right mix of storage reasons: performance, cost, and
technologies to meet the simultaneous regulatory compliance are three of
needs for scale and performance, them. To be competitive, on-prem Realizing the full potential
rather than a homogeneous approach storage must offer the same cost and
of AI/DL requires an
that will fall short. scalability benefits as its cloud-based
counterpart. infrastructure that supports
HOW CLOUDIAN HELPS
HOW CLOUDIAN HELPS innovation. Cloudian
Object storage employs a hybrid
architecture, with spinning disk for Object storage is the storage of the enterprise object storage
user data and SSDs for performance- cloud. In fact, Cloudian® supplies delivers the scalability,
sensitive metadata, thus optimizing object storage solutions to many
cost and performance. cloud providers for use as public cost efficiency, and
cloud infrastructure. The scalability interoperability that
and economics of cloud storage are
5. PARALLEL now available to you on-prem. enhances the capabilities
ARCHITECTURE of these emerging
8. CLOUD INTEGRATION
For data sets that grow without limits,
technologies.
a parallel-access architecture is
essential. Otherwise, the system will Regardless of where data resides,
develop choke points that limit growth. integration with the public cloud will
still be an important requirement for
HOW CLOUDIAN HELPS two reasons. First, much of the AI/DL
Object storage employs a shared- innovation is occurring in the cloud.
nothing cluster architecture, which On-prem systems that are cloud-
means that all parts of the system integrated will provide the greatest
work in parallel. Data throughput flexibility to leverage cloud-native
grows continuously as the tools. Second, we are likely to see a
system expands. fluid flow of data to/from the cloud as
information is generated and analyzed.
An on-prem solution should simplify
6. DATA DURABILITY that flow, not limit it.
Backing up a multi-petabyte training
data set is not always feasible; it would HOW CLOUDIAN HELPS
often be cost and time prohibitive. But Cloudian is cloud-integrated in three
you can’t leave it unprotected either. ways. First, it employs the S3 API, the
Instead, the storage system needs to de-facto standard language of cloud
be self-protecting. storage. Second, it facilitates tiering
to Amazon, Google, and Microsoft
HOW CLOUDIAN HELPS public clouds, and lets you view
Object storage is designed with local and cloud-based data within
redundancy built-in, so data is a single namespace. Third, data
protected without requiring a separate stored to the cloud from Cloudian is
backup process. Furthermore, you directly accessible by cloud-based
can select the level of data protection applications. This bi-modal access lets Cloudian, Inc.
you employ both cloud and on-prem 177 Bovet Road, Suite 450
needed for each data type to optimize
San Mateo, CA 94402
efficiency. Systems can be configured resources interchangeably. Tel: 1.650.227.2380
to tolerate multiple node failures, or Email: info@cloudian.com
even the loss of an entire data center. Web: cloudian.com

© 2018 Cloudian, Inc. Cloudian, the Cloudian logo, and HyperStore


are registered trademarks or trademarks of Cloudian, Inc. All other
trademarks are property of their respective holders. TOP-AIML-018

You might also like