Data Storage Services in GCP: Relational Database Data Warehouse Nosql Big Data Database Service
Data Storage Services in GCP: Relational Database Data Warehouse Nosql Big Data Database Service
Cloud Dataproc Cloud SQL Bigtable BigQuery Cloud Datastore Cloud Storage Cloud Spanner
Managed Services
Serverless Services
Useful links
https://db-engines.com/en/system/Google+Cloud+Bigtable%3BGoogle+Cloud+Datastore
https://cloud.google.com/solutions/data-lifecycle-cloud-platform#processing_large-scale_data
https://cloud.google.com/solutions/data-lifecycle-cloud-platform#storing_object_data
Unstructured/
Structured Semi-Structured
Structured
Cloud Cloud
Cloud SQL Cloud BigQuery Cloud Store
DataStore BigTable
Spanner
Google Cloud Interconnect
Bandwidth
Yes <10 Gbps
No
public
Yes No
Internet
Local
Gateway
Compute
Cloud Storage
Shielded VMs
Quickly protect VMs against advanced threats.
Ensure workloads are trusted and verified.
Help protect secrets against exfiltration and replay.
Python Pandas
Ask Pandascommand
Distinct drop_duplicates(),unique(){togetcountnunique()}
Top 10 records from table head(10)
Show columns df.columns
Show index "df.index"
Group by groupby(‘columns’)[‘columns’].sum()
Give alias to agg filed reset_index(name='QTY_Sum’)
Order by sort_values('QTY_Sum',ascending=False)
Lower str.lower()
substring str.slice(1)/Split
Cast df['DataFrameColumn'].astype(int)
Get count of each record in the column value_counts()
Summarize the DataFrame describe()
Select df[['column1','column2']]
Get column and records count Df.shape
Select all columns except last 3 df.iloc[:,:-3]