Skip to content
View tuni56's full-sized avatar
🤓
Developing new things
🤓
Developing new things

Block or report tuni56

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tuni56/README.md

Rocío Baigorria

Data Engineer | AWS | Kafka | SQL | Python

I build scalable batch and real-time data platforms on AWS, focused on reliability, performance, and cost efficiency.

US Citizen | Open to Remote Roles and Relocation to the United States

Co-Leader at AWS Girls Argentina User Group | Community Speaker across LATAM


Core Expertise

  • AWS Data Platforms: S3, Glue, Athena, Redshift, Lambda
  • Batch & Streaming Pipelines
  • Apache Kafka / Event-Driven Architectures
  • SQL Data Modeling (Star Schema, Fact & Dimension Tables)
  • Python for ETL, automation, and data workflows
  • Terraform / Infrastructure as Code
  • Monitoring, Observability, and Cost Optimization

Community Leadership & Speaking

  • Co-Leader, AWS Girls Argentina User Group
  • Speaker, Data Wizard (Peru)
  • Speaker, AWS User Group La Paz
  • Upcoming Speaker, ACMUD Bogotá and AWS user group Arequipa (May 15)

I enjoy helping engineers grow through practical talks on cloud, data platforms, and real-world architecture.


Featured Projects

Ecommerce Data Warehouse

Analytical platform on Redshift Serverless with incremental ingestion, star schema modeling, and Terraform-managed infrastructure.

Stack: Redshift, S3, SQL, Terraform

🔗 https://github.com/tuni56/ecommerce-data-warehouse-redshift


Real-Time Event-Driven Data Pipeline

Kafka-based streaming architecture designed for resilient ingestion, schema evolution, and observability.

Stack: Kafka, Python, Redis, Grafana, Terraform

🔗 https://github.com/tuni56/real-time-event-driven-data-pipeline


Serverless Data Lake Platform

AWS-native data lake with raw, processed, and curated layers using S3, Glue Catalog, and Athena.

Stack: S3, Glue, Athena, Python, CloudFormation

🔗 https://github.com/tuni56/serverless-aws-data-lake-with-kiro


AWS Serverless Cost Dashboard

Automated pipeline for AWS Cost & Usage Reports with near real-time visibility and alerting.

Stack: Lambda, S3, CloudWatch, SNS, Python

🔗 https://github.com/tuni56/AWS-Cost-Dashboard-Serverless-


Certifications & Professional Development

  • Confluent Data Streaming Engineering Foundations
  • AWS re:Invent All Builders Welcome Grantee
  • Migration and Modernization on AWS
  • AWS AI Practitioner Program
  • McKinsey Forward Program (Leadership, Problem Solving & Business Communication)

Connect

GitHub Space Invaders


Pinned Loading

  1. ecommerce-streaming-data-platform ecommerce-streaming-data-platform Public

    Real-time ecommerce streaming data platform using Kafka, AWS Route 53 routing, event-driven architecture, and observability with Grafana.

    Python

  2. serverless-aws-data-lake-with-kiro serverless-aws-data-lake-with-kiro Public

    Cost-optimized serverless AWS data lake using S3, Glue, Athena, CloudFormation, and Kiro. Raw/curated architecture, Parquet, automated crawlers, and zero-idle compute.

    Python 3

  3. iot-data-architecture-aws iot-data-architecture-aws Public

    Cost-effective AWS architecture for ingesting, storing, and querying 5 years of IoT sensor data using a serverless data lake approach.

    1 1

  4. AWS-Cost-Dashboard-Serverless- AWS-Cost-Dashboard-Serverless- Public

    AWS Cost Dashboard Serverless

    Python

  5. real-time-event-driven-data-pipeline real-time-event-driven-data-pipeline Public

    Real-time event streaming pipeline with Kafka, Schema Registry, Kafka Streams, and production monitoring. Demonstrates advanced data engineering patterns at scale.

    Java

  6. datalake-analytics-pipeline datalake-analytics-pipeline Public

    Python