Professional Documents
Culture Documents
Referensi Day 2
Referensi Day 2
Referensi Day 2
2. Bytes: Tipe data yang merepresentasikan urutan byte atau data biner.
Biasanya digunakan untuk menyimpan data gambar, file, atau data
terkompresi. Contoh: 0x48656C6C6F (representasi heksadesimal dari
string "Hello").
FORMAT DATA
json
{"id": 1, "name": "John Doe", "age": 25}
{"id": 2, "name": "Jane Smith", "age": 30}
{"id": 3, "name": "Mike Johnson", "age": 35}
avro
{"type": "record",
"name": "Person",
"fields": [
{"name": "id", "type": "int"},
{"name": "name", "type": "string"},
{"name": "age", "type": "int"}
]}
{"id": 1, "name": "John Doe", "age": 25}
{"id": 2, "name": "Jane Smith", "age": 30}
{"id": 3, "name": "Mike Johnson", "age": 35}
PARQUET
[id: 1, name: "John Doe", age: 25]
[id: 2, name: "Jane Smith", age: 30]
[id: 3, name: "Mike Johnson", age: 35]
ORC
[id: 1, name: "John Doe", age: 25]
[id: 2, name: "Jane Smith", age: 30]
[id: 3, name: "Mike Johnson", age: 35]
Perlu diingat bahwa Parquet dan ORC merupakan format data biner
yang biasanya digunakan dalam lingkungan big data untuk efisiensi
penyimpanan dan pemrosesan.