Download as pdf or txt
Download as pdf or txt
You are on page 1of 13

English

PoINT Archival Gateway


2 Archival Gateway Archival Gateway 3

Contents
4 Introduction

6 Product Overview
Key Features
7 Use Cases
S3 Archiving to Tape
Tiering / ILM for On-Prem Object Storage
Backup of Cloud/Object Data
9 Architecture and Terms
Object Repository (Bucket)
Archival Storage Partition
Archival Storage Device
11 Design and Concept
Interface Nodes
Database Nodes

Your data is Networks


14 Workflow
the key, so don‘t Writing Objects

lose it. Reading Objects


15 Configuration Options
16 Functionality
S3 REST API
Performance
Erasure Coding
Redundancy
Data and Management Security
Encryption
Retention Management
Lifecycle Management
Audit Logs
Admin GUI
21 Supported Archival Storage Devices
Tape Systems
4 Archival Gateway Archival Gateway 5

Introduction

Growth of unstructured data is one of as storage medium, like LTO and 3592, price ratio. In combination with its WORM available, tape technology has the potential
the biggest IT data center infrastructure provides a solution to this challenge. capabilities tape technology also fulfills for essential capacity improvements.
challenge for enterprises, according to Like file and block storage, object storage is archiving requirements. PoINT Archival Gateway is a software-
recent surveys of research firms. This a way to store data. The difference between Additionally, removable media like tape defined scale-out object storage system
challenge cannot be solved economically file, block and object storage is that object create an “air gap” against malware. Thus, designed to store and manage massive
and technically meaningful with file and storage has been designed for scalability, tape technology is perfectly suited for data amounts of data on tape libraries with
block storage (or NAS and SAN) based on cost-effectiveness, reliability and high protection and backup. outstanding performance. PoINT Archival
hard disk or flash technology. availability. In principle the way to store Gateway provides a standardized S3 REST
data is independent from the underlying For these reasons, an approach which API and supports a wide range of tape
Additionally, studies show that the majority storage media. E.g., in the same way as files combines object storage and tape libraries.
of unstructured data is inactive. This data can be stored on hard disk and tape in a file technology as storage medium provides
is not frequently used, but still needs to system structure, also objects can be stored an innovative and economic solution for In this technical white paper, a detailed
be preserved for business or compliance on different storage media. the data growth problem. Additionally, technical description of PoINT Archival
reasons. It does not make sense to store compared to all the storage media Gateway is provided.
huge amounts of inactive data on hard disk Considering the massive growth of
or flash based storage systems. unstructured data within the next years in
combination with the fact that majority of
Software-defined object storage offering this data becomes inactive very fast, tape
a standardized REST API, like Amazon is the only storage media which provides
S3, using standardized tape technology sufficient data capacity with an acceptable

DBN & IFN

Client Network PoINT Archival Gateway Tape Library


6 Archival Gateway Storage Manager 7

Archival
Gateway

PoINT Archival Gateway offers


standardized interfaces and protocols like
the Amazon S3 REST API which may be
used by the actual client applications for
storing and reading data on the supported
archival storage devices, drives and media.
Essential services provided by PoINT
Archival Gateway include user, data and
storage management as well as access
control, logging and monitoring.

Product Overview K E Y F E AT U R E S

• High performance in terms of data and


PoINT Archival Gateway is a high- object rates
performance, scale-out, software-based • High availability and reliability
object storage solution designed to • High scalability (incl. load-balancing,
manage massive amounts of data. redundancy and failover)
• Industry standards (S3 REST API, LTO
PoINT Archival Gateway is building a and 3592 tape drives)
bridge between client applications or • Object versioning
systems, respectively, and archival storage • Data protection (erasure coding, authen-
systems like tape library systems by tication and encryption)
providing services and functions which are • Self-monitoring, reporting and alerting
mandatory for data archival solutions in • User management based on domain
enterprise data centers. services (AD, LDAP)
8 Archival Gateway Archival Gateway 9

Use Cases
S3 A RC H I V I N G TO TA P E T I E R I N G / I L M F O R O N-P R E M O B J E CT S TO RAG E B AC K U P O F C LO U D / O B J E CT S TO RAG E

PoINT Archival Gateway provides a solution


for a wide range of requirements. With its PoINT Archival Gateway fulfils archiving and Public cloud storage providers like AWS Backup of cloud and object data becomes
standardized S3 REST API PoINT Archival compliance requirements thanks to WORM and Microsoft Azure offer different S3 more and more important. It has already
Gateway can be used by many S3-capable functionality and integrated retention storage classes with different performance happened that public cloud providers
applications. management. As a result, saved data is features to their customers. On premises disappeared and stored data was no
protected not just against unintended object store products which are used as longer accessible. Also, clouds are not
The following figure gives some examples deletion, but also against manipulation private cloud offer only one storage class invulnerable to malware attacks. The same
of applications and environments which (e.g. from ransomware attacks). which is typically hard disk based. This is applies to data on on-prem object stores.
can be integrated with and connected to inefficient because active and inactive data For this reason, backup of cloud and object
PoINT Archival Gateway. PoINT Archival Gateway provides persistent are stored on the same storage technology. data is essential.
data management for meeting legal and
business data archival requirements. The combination of hard disk-based object PoINT Archival Gateway allows to make
Appropriate retention rules can be enabled store with PoINT Archival Gateway allows backups from your cloud and object data to
and specified on object repository level. to tier inactive data from hard disk to tape in its native S3 form. Your object data
The rules define how and when existing tape. For this purpose, many object store will be saved in the same structure as your
objects may be modified or deleted. products provide already integrated ILM original data. This means saved data can be
functions. directly accessed through the S3 interface
of PoINT Archival Gateway. A long retrieval
PoINT Archival Gateway - Ecosystem process is not necessary.
O B J E C T S TO R A G E T I E R I N G D ATA M A N A G E M E N T

O B J E C T S TO R A G E R E P L I C AT I O N

P U B L I C C LO U D B A C K U P B I G D ATA R E P O S I TO RY


10 Archival Gateway Archival Gateway 11

Architecture and Terms Design and Concept

The figure below provides an overview as management units for the entire archival PoINT Archival Gateway leverages the of object records. In addition, databases
of the architecture and terms used in storage space available on all Archival object storage architecture and manages support arbitrary extension of existing
the scope of a PoINT Archival Gateway Storage Volumes which are available in data as objects. Each object includes a records as well as indexes and highly
installation. the Archival Storage Devices. Multiple unique identifier of the object, the data performant queries covering all existing
Archival Storage Partitions can be created, itself as well as a variable amount of custom records. These features and functions are
O B J E C T R E P O S I TO RY ( B U C K E T )
each of which having different settings metadata. This architecture basically allows essential for supporting various kinds and
PoINT Archival Gateway allows to and restrictions (e.g. size limitations, access scalable retention of massive amounts of versions of object storage interfaces like
create an arbitrary number of Object rights, levels of protection, redundancy or unstructured data on removable media like the S3 REST API.
Repositories (Buckets), each of which can availability) and using separate Archival tape media in libraries.
have different settings and restrictions Storage Volumes. By default, PoINT Archival This separation of data and metadata is
(e.g. size limitations, access rights). Client Gateway automatically extents the size of PoINT Archival Gateway explicitly separates reflected by the design of PoINT Archival
applications and systems can store and Archival Storage Partitions by allocating data and metadata. Unique identifiers and Gateway which requires assigning the
read objects in the Object Repositories by additional Protected Volume Arrays (see metadata are stored in common databases, services to separate and dedicated server
using the S3 REST API of PoINT Archival below) when the lastly allocated array has while data are stored on tape media. Copies systems. Consequently, the PoINT Archival
Gateway. Object Repositories of the same completely been filled with data. of the metadata and unique identifiers are Gateway software consists of two software
Archival Storage Partition share the same stored on storage media, too, to support packages, each of which may be installed
A RC H I VA L S TO R A G E D E V I C E
Protected Volume Arrays. disaster recovery and verification processes. on a dedicated server system. A dedicated
Archival Storage Devices are media changer server system including its basic operating
A RC H I VA L S TO R A G E PA RT I T I O N
devices (tape libraries or tape loaders) with Object storage architectures typically use system software and the software package
PoINT Archival Gateway uses a logical a specific number of integrated tape drives. databases to store metadata and unique is called a gateway node. Defined by the
construct called Archival Storage Partition identifiers. In contrast to other architectures installed software package, a gateway
(e.g. file systems and their hierarchical node serves exactly one of two possible
S3 S3 S3 structures), databases do not impose purposes.
any kind of limit on character sequences
Object Repository ( Bucket) which can be used in unique identifiers
Archival Storage Partition
and custom metadata or on the number
- Space provided by PVAs Partition
- Expands automatically

Protected Volume Array (PVA)


- 1-4 volumes PVA PVA
- Protected by Erasure Codes

Archival Storage Volume


- Tape media

Archival Storage Device


- Group of up to 8 tape loaders
or library units
12 Archival Gateway Archival Gateway 13

INTERFACE NODES

A gateway interface node (IFN) primarily the system configuration (Admin GUI) and The following picture illustrates the networks used by the solution by showing
is the communication partner for client the management and control modules for design of PoINT Archival Gateway and the a sample deployment.
applications or systems. By means of Archival Storage Devices (tape libraries),
dedicated interface modules, it provides an which therefore are also located on this
S3 REST API for storing and reading objects node. Interface Nodes Database Nodes

to client applications and systems and


Tape Library
transfers data between client applications PoINT Archival Gateway requires at least
or systems and the Archival Storage Volume one working IFN and one working DBN to
Client
(i.e. tape media). be operable.

In detail, the IFNs provide the following PoINT Archival Gateway fulfills the highest
modules and services: levels of performance, availability and
Client
• HTTP service module (i.e. S3 REST API) scalability requirements by means of
• Data buffering module this fully scalable and redundant design, Tape Library
• Data encoding module (e.g. erasure providing scalable performance and
coding, hashing, encryption) redundancy levels for both service and Client
• Driver module for tape drives data entities. Because scalability and
• Metadata caching module (e.g. caching of availability of a single node is restricted
object metadata and configuration data) due to limitations of server hardware and
• Communication module for metadata operating systems, PoINT Archival Gateway
Archival Gateway
exchange with database node supports installing multiple DBNs and IFNs Storage Network
Client
Archival Gateway
in a single deployment in order to build Network
Cluster Network
(internal)
D ATA B A S E N O D E S (external)
clusters for increasing performance (i.e. load (internal)

A gateway database node (DBN) primarily balancing) and availability (i.e. failover and
provides central database services to the redundancy).
IFNs. The database holds the index of the
objects which have been stored e.g. object PoINT Archival Gateway automatically
NETWORKS
keys and metadata, storage locations of performs all tasks which are required to
object data on the Archival Storage Volume reconstitute operability and consistency of The storage network of the PoINT Archival
(tape medium), as well as configuration and a cluster node which had been affected by Gateway can be an iSCSI or Fibre Channel
maintenance data of PoINT Archival Gateway. a failure before, of course provided that the network connecting Archival Storage
Additionally, the database stores data from system administrator has re-established the Devices (e.g. tape libraries with tape drives)
logging and monitoring processes and operability of the server system, operating to the gateway nodes, while typically the
provides corresponding auditing services system and installed PoINT Archival other networks are Ethernet networks.
and log files. Further central services are Gateway software package before.
14 Archival Gateway Archival Gateway 15

Workflow Configuration Options

Clients connect to the interface modules sends a specific error response if the hash Typically, the software packages and thus However, for systems which are not
of the IFNs, which currently is the HTTP codes do not match. Then, it forwards the the services of PoINT Archival Gateway required to provide the maximum levels, a
service module providing the S3 REST API. request and its associated data buffers to should be installed on separate server special software package allows installing
Clients can send their requests to any IFN, the driver module for the tape drive to systems because this kind of deployments one database and one interface service on
and the IFNs either process the request write it to the tape media. In this module, provides the maximum levels of scalability, the same server system. In this case, the
and send the final response, or they the original data buffers are reorganized, availability and performance. database and interface service functions
instruct the clients to send the requests interleaved with data buffers from other are combined in a single and compact
to another IFN by sending a special objects, protected by adding erasure service module to optimize the common
redirection response. codes and logical block protection codes, use of resources and to remove overhead
optionally encrypted, and written to the caused by inter-service communication.
Redirection responses can occur if the Archival Storage Volume (tape media).
original request transfers object data to or Finally, after the data have successfully
from tape media (e.g. GET or PUT requests). been written to the Archival Storage
In this case, the appropriate Archival Volume, the HTTP service module stores
Storage Volume (i.e. tape medium) has the object metadata and the location
to be loaded in a free tape drive, and the of the object data on Archival Storage
request typically has to be redirected to Volume in the database and sends a
the specific IFN that is connected to this positive response to the clients.
tape drive because this IFN can read or
write the data.
READING OBJECTS

For reading an existing object, a client DBN & IFN


WRITING OBJECTS
sends an appropriate GET request.
When a client desires to write a new Provided that redirection is not required, Client Network PoINT Archival Gateway Tape Library
object, it sends an appropriate PUT request the HTTP service module of the addressed
including the object data. Provided that IFN first reads the object metadata and
redirection is not required, the HTTP the location of the object data from the
service module of the addressed IFN database. Then, it forwards the request
receives the object data and stores it in to the driver module for the drive units
data buffers in memory (q.v. data buffering to read the object data from the Archival
module). In addition, the IFN computes a Storage Volume (tape medium). This
hash code of the data and adds it to the module regenerates the object data from
metadata of the object. Optionally, the IFN the data on the Archival Storage Volume,
compares its hash code to a hash code copies it to data buffers in memory, and
which has been computed and transferred immediately sends the data buffers to the
by the client in the request header, and client.
16 Archival Gateway Archival Gateway 17

Functionality
S3 R E S T A P I PERFORMANCE ERASURE CODING

The PoINT Archival Gateway offers a The PoINT Archival Gateway is a software- Data security is ensured by an erasure using two, three or four Archival Storage
standardized S3 REST API. This makes the based, high-performance object storage coding process. This saves data blocks Devices (tape libraries) in parallel. A
software suitable for use with the rapidly system. It supports mass storage media redundantly on multiple media, meaning Protected Volume Array consisting of N
rising number of applications that support such as tape libraries. This combination that data is not lost even if a medium fails. tape media may span N libraries.
object-based storage using S3 REST. makes it possible to store and archive
hundreds of petabytes of data. The decisive Erasure code rates supported by PoINT In the following two figures the write
The PoINT Archival Gateway‘s design is factor that makes this possible is the Archival Gateway are 1/2, 1/3, 1/4, 2/3, path and read path are illustrated in case
independent of the storage technology way the PoINT Archival Gateway quickly 2/4 and 3/4. In combination with erasure a erasure coding rate of 3/4 has been
or system used, meaning that users can receives data and securely writes it to coding, the level of data protection and selected.
choose and swap out such systems with Archival Storage Volume (tape media) in redundancy can further be advanced by
confidence. The PoINT Archival Gateway‘s a format that means this data can also be Tape Drives
64K Segment Size
highly scalable S3 REST web service quickly read again afterwards.

{
enables almost unlimited parallelization S1 S1
and very high data transfer rates. The PoINT Archival Gateway’s high level S1
of scalability means it can handle transfer

Object Stream
S2

Application
Write Segmentation Encoding Write
rates of over 1 PB per day. S2
S2

S3 PUT Tape
S3
S3 S3
EC

EC

Write Path

Tape Drives

S1
S1

Object Stream
Read Reassembling Read

Application
S2
S2
S3 GET
S3 S3

EC

Read Path
18 Archival Gateway Archival Gateway 19

R E D U N D A N CY

Redundant server nodes automatically respectively, defines how principals must Once enabled, retention rules cannot actions on objects and incomplete multi-
managed by PoINT Archival Gateway mean authenticate themselves by providing their be disabled again, and affected object part uploads. Expiration policies cannot be
that the server systems running the PoINT credentials. repositories cannot be deleted before the filtered, but can only be applied to an entire
Archival Gateway remain stable. retention periods of all contained objects bucket. Policies defining transition actions
has expired. However, retention periods can are ignored by PoINT Archival Gateway.
E N C RY P T I O N
be extended at any time and, in addition,
D ATA A N D M A N A G E M E N T S E C U R I T Y
Data and metadata of objects may administrative deletion of retained objects PoINT Archival Gateway removes expired
The system’s approach to management optionally be encrypted by PoINT Archival is possible, but PoINT Archival Gateway objects and aborts incomplete multipart
security is made possible by user and Gateway before they are written to the unconditionally logs administrative uploads one day after the specified date
permission management concept. Access Archival Storage Volumes (tape media). deletion of objects. has been reached. This process runs once
rights can be assigned for these guidelines. Encryption leverages cipher blocks per day and automatically starts at the end
To this end, the administrative interface chaining algorithms, is applied on block of a day (i.e. at midnight local time).
L I F E CYC L E M A N A G E M E N T
provides functionality for creating local level and can be enabled or disabled
security guidelines for users and groups. on Archival Storage Partition level. For In addition to data retention management Expiration actions cannot delete objects
The system can also inherit external encryption, either a common system key as described above, PoINT Archival Gateway of which the retention period has not yet
guidelines from an Active Directory domain. or a partition specific key may be used. The provides a set of lifecycle management elapsed. In such a case, the expiration
encryption algorithms supported by PoINT functions of Amazon S3. action is performed at earliest as soon as
Access rights for the objects in an Object Archival Gateway use cryptographic keys of the retention period has been elapsed.
Repository are granted by adding a local 128 (AES-1) and 256 (AES-2) bits length. In particular, PoINT Archival Gateway
principal of type user or an external supports policies defining expiration
security principal to the list of authorized
RETENTION MANAGEMENT
object repository principals. While adding
a principal, a security manager may define PoINT Archival Gateway provides persistent
granular access rights by specifying if data management for meeting legal and
the principal shall be allowed to a list, business data archival requirements.
read, write or delete objects in the object Appropriate retention rules can be enabled
repository. and specified on Object Repository level
and apply to all objects in the repository.
Principals created and used for providing The rules define how and when existing
access rights to object repositories may, but objects in the object repository may be
do not need to have management roles. modified or deleted.

Access to the objects in object repositories The retention rules basically allow
is exclusively possible through the specifying retention periods and to issue or
client interfaces and protocols, where retire legal holds and can be managed by
the individual interface or protocol, means of the system configuration GUI.
20 Archival Gateway Archival Gateway 21

Supported Archival
A U D I T LO G S Storage Devices
PoINT Archival Gateway supports data In addition, all logon actions of managers
TA P E S Y S T E M S
access audit logs and security audit logs. and all modifications applied by security The PoINT Archival Gateway supports a
managers or either way related to security wide range of tape libraries and PoINT The following table provides an overview
Accesses to the data objects of an Object settings are logged in a protected security works closely with leading manufacturers. about the currently supported tape systems
Repository are logged in associated audit log file. This log file includes As a result, the list of supported products is (loaders and libraries):
access audit log files. Log records include information about the principals, including being expanded all the time.
identifier, time stamp and kind of access details about the modification.
Vendor Product
as well as an identifier of the principal who
performed the access. actidata actilib Library 2U

BDT FlexStor II

Cristie GigaStreamT8 GigaStream T24

GigaStreamT48 GigaStream T24

HPE MSL2024 MSL8096

StoreEver MSL6480

IBM TS3100 Tape Library TS3200 Tape Library

TS3500 Tape Library TS4300 Tape Library

TS4500 Tape Library

Qualstar RLS-8560 RLS-85120

XLS-832700
ADMIN GUI
Quantum Scalar i3 Scalar i6
The primary configuration interface is the module forwards configuration data and
Admin GUI provided by an HTTP service of corresponding changes to the affected Scalar i6000

PoINT Archival Gateway. nodes.


PoINT Archival Gateway supports LTO and PoINT Archival Gateway directly supports
The service is hosted by Microsoft Internet 3592 tape drives. and integrates tape libraries. No additional
Information Server instances running on drivers or software products are required.
the DBNs. The Admin GUI communicates Please contact PoINT Software & Systems
with the configuration module via a local GmbH for support of tapes systems that A PoINT Archival Gateway installation
network connection. The configuration are not yet listed. supports up to 8 tape libraries with a
maximum number of 256 tape drives.
22 Archival Gateway Archival Gateway 23

PoINT Software &


Systems

PoINT Software & Systems is specialized in PoINT products are distributed in more than
the development of software solutions for 25 countries world-wide and have been
storage and management of data using all installed successfully in more than two
available mass storage technologies like million installations. Our customers range
cloud and object storage, hard disk, magnetic from end users expecting a compact and
tape and optical. Close collaboration with secure solution to large corporations, which
leading hardware manufacturers enables comply with our solutions their complex
an early support of innovative storage demands by providing the necessary
technologies. Besides complete solutions reliability and perfection.
PoINT also offers its know-how as Toolkits,
which can be easily integrated in other
applications by the programming interface.
Furthermore we project entire storage
solutions and provide consultancy with our
long-term and versatile experience.
PoINT Software & Systems GmbH P +49 271 3841-0
Eiserfelder Straße 316 M info@point.de
57080 Siegen, Germany W www.point.de

You might also like