Projects / Automated News Data Pipeline
About Automated News Data Pipeline
A production-ready data pipeline that scrapes top posts from r/ArtificialInteligence, loads them into Snowflake, and transforms them into analytics-ready tables using dbt -- all orchestrated by Apache Airflow and fully Dockerized for one-command deployment.
Skills and Technologies
PythonPosgreSQLSnowflakedbtApache AirflowDockerGitHub Actions