English Learning
AWS SAA 学习笔记
无麸质饮食记录
15-Big Data
type
status
date
slug
summary
tags
category
icon
password
- 3 Vs
- volume
- variety
- velocity
- redshift
- data warehouse
- large relational db
- base postgreSQL
- info
- multi-az
- snapshots
- no conversions for az
- support 16pb data
- support s3
- EMR
- EC2
- elastic map reduce
- support hive spark 。。。
- ETL:
- extract
- transform
- load
- EMR storage
- HDFS
- Hadoop distributed file system
- EMR File System
- EMRFS
- local file system
- cluster and nodes
- primary node
- core node
- task node
- Architecture
- kinesis
- real time streaming data
- roles
- producers
- kinesis
- consumer
- kinesis data analytics
- vs sqs
- realtime ⇒ kinesis
- SQS - simpler
- Kinesis fassster and store data for up a year
- data streaming not auto scale, data firehose does
- amazon athena & aws glue
- athena: serverless sql solution
- query service
- glue: serverless data integration
- serverless ETL service
- Amazon QuickSight
- bi data visualization service
- column level security
- SPICE: in memory engine
- create a dashboard
- AWS Data Pipeline
- Extract Transform Load service
- automated workflows
- data driven
- Amazon Managed Streaming for Apache Kafka
- Amazon MSK
- manage data plane operations
- Amazon OpenSearch Service
- elastic search
Last update: 2024-04-04