We are looking for a Data Engineer (mid-level profile) to join a social impact organization focused on improving educational opportunities, emotional well-being, and personal development for people in vulnerable contexts.
In this project, data is not used to increase sales or optimize a digital product, but to demonstrate real social impact, evaluate programs, and defend functional models before public administrations. Data is a tool for social transformation.
Turn data into evidence that helps improve and scale social projects
Bring rigor and traceability to models impacting education, health, and community development
Participate in building the data foundation that will support the 2027–2030 strategic plan
Evolve a Data Lake on Azure + Databricks, currently at an early stage
Move from raw data to a Medallion Architecture (Bronze / Silver / Gold)
Integrate multiple data sources from very different contexts
Ensure data quality, consistency, and reliability
Work with well-defined indicators that are not yet fully exploited
Azure infrastructure
Databricks as the core processing platform
Development in Scala (main) plus PySpark and SQL
Salesforce as a transversal tool
Data sources to integrate: internal tools, Excel, Typeform, Moodle, Sage, Factorial, among others
Growing ecosystem, with support from an external consultancy
Startup mindset: pragmatic and focused on building
Required
Experience as a Data Engineer in cloud environments (Azure)
Databricks
Nice to have
Scala and/or PySpark
Strong SQL
Experience designing and maintaining ETL / ELT pipelines
Very important
Autonomy and ownership
Ability to understand the business and translate it into data solutions
Good communication skills
Continuous improvement and builder mindset