Github Actions: Configure OpenID Connect [OIDC] Provider in AWSLong-term credentials, such as IAM (Identity and Access Management) access keys and secret keys in AWS, can pose significant security risks…Feb 7Feb 7
GCP: Create your DataProc Cluster with Spark and Dataproc Metastore HiveWhat is a Google Cloud Dataproc?Feb 4Feb 4
GCP Data Governance: Column Level Security Best Practices — Taxonomies, Data Class, Policies, and…Data Governance and Access Control encompasses managing data and its authorized usage, ensuring accuracy, protecting sensitive information…Jan 261Jan 261
Data Discovery in GCP: Dataplex Tags, Tag Templates, Entry Groups and EntriesA data-driven organization is crucial for business and Efficient data discovery is crucial for business success. As data stewards and…Jan 16Jan 16
Dataproc Metastore: Create A Fully Managed Hive Metastore on GCPDataproc Metastore is a fully managed, highly available, autohealing, serverless, Apache Hive metastore (HMS) that runs on Google Cloud.Jan 8Jan 8
GitHub Actions 101: Automate Your GitHub Projects: Getting StartedGitHub Actions is a powerful automation tool for building, testing, and deploying your code directly from your GitHub repository.Nov 2, 2023Nov 2, 2023
Spark-Scala-Gradle-Bootstrap: A Spark Project in Scala with GradleTL;DR: Setting up a development environment for Spark, Scala, Gradle, Delta lake, unit tests, and Scala code coverage can be…Oct 23, 2023Oct 23, 2023
A Generic Kafka, Kstream Producer Consumer SDK. Underated!!!🚀 Calling all Kafka Enthusiasts! 🚀Oct 5, 2023Oct 5, 2023