Professional Documents
Culture Documents
Mithun Snowflake
Mithun Snowflake
Data ware — repository fr storing and managing large volume of str and unstructed data
Snowflake is a cool cloud-based data warehousing platform to store and analyze data in very
productive way
Its utilizes a unique architecture called multi-cluster, shared data architecture (MPP)
which separates compute from storage, which makes scalable
Features
Snowflake is cloud-agnostic- it can support and run in most of all cloud platform like AWS
,Azure, Google Cloud which is selling point
virtual warehouses
data sharing
Snow flake : 1
Can intergrate with wide range if BItools ,data visuliization platforns and analytics
services for data analysis and reporting
Layers
Cloud infrastructure layer
Authentication
Infrastructure management
Metadata management
Access control
stroage layer
data is only accesble thru sql query operations which we run on snowflake
compute layer
query processing layer
XSMALL - 1 node
SMALL - 2 node
MEDIUM - 4 node
LARGE - 8 node
XLARGE - 16 node
security layer
meta-data layer
Snow flake : 2
Scaling
horizontal (in which we increase the size of ware house)
vertical (in which we increase the number of ware house via nodes )
Caching
metadata cache
warehouse cache
result cache
Semi-std - JSon,Xml,Orc,Avro,paraquet
Snow flake : 3