About the Job
We are seeking a detail-oriented and proactive Data Engineer to join our team as a contractor, with the potential to transition into a full-time role based on performance and available funding. This position will play a key role in developing scalable and reliable data infrastructure, including the construction of simulation pipelines that simulate millions of agents at real-time scales.
This role is ideal for candidates with strong Python and SQL skills, a solid foundation in data engineering best practices, and a passion for building reliable systems to support analytical and modeling workflows. The ideal candidate is also passionate about making an impact by solving complex problems related to the energy system and its role in climate change.
As a Data Engineer, you will be responsible for gathering and cleaning data from a wide variety of sources, designing efficient data pipelines, managing cloud-based data storage, and ensuring data quality across workflows. This is a high-impact role directly supporting the development of cutting-edge simulations that provide guidance to decision-makers.
This role requires that you both live in the United States and have authorization to work in the United States.
About us:
You will be joining Macrocosm, an early stage startup that is using complexity economics to revolutionize economic modeling. Our purpose is simple: better economics for a better world.
We are building the world's only empirically validated digital twin of the global economy, and our initial focus is on the energy system. You will be creating a robust data pipeline for a product platform that provides foresight into a number of key economic indicators (e.g., energy prices, energy supply, firms’ profits, technology costs, GDP, inflation, unemployment, etc.). This decision platform will be used to guide decisions for a range of clients, including asset managers, energy firms, institutional investors, policymakers, central banks, and others.
Key Responsibilities:
Design, build, and maintain scalable data pipelines for ingesting, transforming, and storing structured and unstructured data.
Optimize cloud-based data pipelines to support large-scale agent-based modeling simulations, including input/output data management, configuration tools like Hydra, and visualization in cloud environments.
Integrate data from APIs, flat files, databases, and other sources into a centralized infrastructure.
Implement data quality checks, monitoring systems, and validation tools to ensure accuracy and reliability.
Collaborate with modeling teams to support simulation and analytics workflows while also ensuring data availability and consistency.
Maintain documentation of data architecture, processes, and workflows.
Contribute to the continuous improvement of our data infrastructure and tooling.
Required Qualifications:
Bachelor’s degree (BA/BS or equivalent) in Computer Science, Engineering, Data Science, Information Systems, or a related field.
Must reside and be authorized to work in the United States.
Proficiency in Python, with experience using data-related libraries (e.g., pandas, SQLAlchemy, requests).
Proficiency in SQL, including experience large-scale data querying and managing relational databases.
Experience building and maintaining production-grade ETL pipeline development.
Experience working with the Google Cloud Platform (GCP) ecosystem (e.g., BigQuery, Cloud Storage, Cloud Functions, Pub/Sub).
Familiarity with Git and collaborative coding practices.
Strong analytical, quantitative, and critical thinking skills.
Ability to work independently, manage time effectively, and meet project deadlines.
Preferred Qualifications:
Experience in one or more of the following domains: energy markets, financial markets, or economic modeling.
Familiarity with data orchestration tools such as Airflow, Prefect, or Cloud Composer.
Familiarity with data warehousing solutions (e.g., Snowflake, BigQuery, Redshift).
Understanding and knowledge of data governance, metadata management, and best practices for data quality and security.
Experience in complexity economics or agent-based modeling
About the Company

Macrocosm
<p>At Macrocosm, our mission is to help guide the world toward a more prosperous, stable and sustainable economy. We aim to do this by <strong>accelerating the green energy transition</strong>. </p><p>We are doing this by<strong> building a digital twin of the entire global economy </strong>that allows us to simulate the impact of firms' decisions on their financial performance, and to test their strategies against new policies, technological change, competitive dynamics, etc. Unlike existing solutions, our products are both empirically validated with vast amounts of data and embedded with industry-specific domain knowledge.</p><p>Our current focus is on building an integrated decision-making platform for energy investors. We are hard at work building this platform, which combines insights from complexity economics, deep analytics, and ML to generate actionable insights for energy investors. Our hypothesis is that we can help decision-makers navigate and steer the energy transition by providing better foresight</p>
Similar Jobs

Data Engineer
Data Engineer
- Quatt
- Amsterdam, NH, NL
- Hybrid
- Full time role
Accelerating sustainable heating adoption with smart heat pumps, reducing gas consumption by up to 80%.
About 2 months ago

Senior Data Engineer
Senior Data Engineer
- Kevala inc
- Remote
- Full time role
- $140,000 – $180,000 / Yearly
"Decarbonizing global energy with comprehensive, transparent data solutions."
About 1 month ago

Data Engineer
Data Engineer
- Zeelo
- United Kingdom
- Remote
- Full time role
Carbon-neutral AI transit solutions with electric and efficient shuttles for commuters and students.
29 days ago

Senior Data Engineer
Senior Data Engineer
- Zanskar
- Salt Lake City, UT, US
- Hybrid, Remote
- Full time role
Affordable, 24/7 carbon-free geothermal power for a sustainable future.
18 days ago

(Senior) Data Engineer (m/f/d)
(Senior) Data Engineer (m/f/d)
- 1komma5°
- Munich, BY, DE, Berlin, BE, DE, Hamburg, HH, DE
- Hybrid, Remote
- Full time role
Holistic CO2-neutral buildings with decentralized clean energy, mobility, and heat supply solutions.
16 days ago

Data Engineer (M.S. or Ph.D.)
Data Engineer (M.S. or Ph.D.)
- Exponent
- Irvine, CA, US
- Hybrid
- Full time role
Engineering and scientific consulting for safer, healthier, and more sustainable solutions globally.
16 days ago

Research analyst
Research analyst
- Macrocosm
- Remote
- Contract position
We are using complexity economics to accelerate the energy transition by providing decision-makers with better insights derived from cutting-edge simulations
11 days ago

Data Engineer
Data Engineer
- Vertical aerospace
- Bristol, England, GB
- Hybrid, Remote
- Full time role
Revolutionizing personal air travel with on-demand, carbon-free aircraft.
9 days ago

Data Engineer
Data Engineer
- Archer
- San Jose, CA, US
- In-person
- Full time role
Revolutionizing urban transport with climate-friendly electric vertical takeoff and landing aircraft.
8 days ago

Senior Data Platform Engineer
Senior Data Platform Engineer
- Choco
- Munich, BY, DE, London, England, GB, Berlin, BE, DE
- Hybrid, Remote
- Full time role
Streamlining supply chain communication, reducing food waste, and enhancing sustainability in the restaurant industry.
8 days ago