Careers/Data Acquisition Engineer

Data Acquisition Engineer

We are looking for a Data Acquisition Engineer to lead the ingestion, parsing, cleaning, and structuring of large external datasets used across our financial data platform. You will work with a wide range of inputs and turn them into clean, well-organized, analysis-ready assets.

Apply Now

About the Role

We are looking for a Data Acquisition Engineer to lead the ingestion, parsing, cleaning, and structuring of large external datasets used across our financial data platform. You will work with a wide range of inputs—including public datasets, financial reference data, and alternative data—and turn them into clean, well-organized, analysis-ready assets.

You will take datasets that have been identified for integration and handle the full technical onboarding lifecycle: ingesting raw data, designing parsing logic, developing cleaning and normalization workflows, implementing validation checks, and producing high-quality structured outputs. If you enjoy bringing order to messy, heterogeneous datasets and building reliable ingestion pipelines that support financial research and analytics, this role is for you.

Responsibilities

  • Own the technical onboarding of external datasets, including parsing raw files, transforming fields, and producing clean structured outputs.
  • Write parsing and transformation logic in Python and SQL to handle diverse file formats (CSV, JSON, XML, HTML, XBRL, PDF, etc.).
  • Develop reproducible ETL/ELT workflows that clean, normalize, validate, and structure incoming datasets.
  • Manage data storage and processing workflows using S3-compatible object storage systems.
  • Produce efficient, analytics-ready Parquet datasets, using appropriate partitioning and metadata conventions.
  • Implement data-quality checks to detect anomalies, schema drift, missing fields, or unexpected changes in incoming data.
  • Troubleshoot and resolve inconsistencies through systematic, transparent cleaning and transformation rules.
  • Collaborate with internal data and research teams to understand dataset characteristics, quirks, semantics, and intended uses.
  • Provide light technical input during dataset evaluation, offering insight into ingest feasibility and transformation complexity.
  • Write clear documentation describing dataset structure, parsing assumptions, transformation logic, and known limitations.

Skills & Qualifications

  • Direct experience working with financial datasets at a hedge fund, financial institution, or major financial data provider.
  • Proven track record onboarding or structuring large, complex financial or finance-adjacent datasets, such as market data, fundamentals, regulatory data, reference data, or alternative data.
  • Strong proficiency in Python for parsing, cleaning, and transformation workflows.
  • Strong SQL skills for exploration, validation, and modeling.
  • Hands-on experience working with S3-compatible object storage for large dataset management.
  • Proficiency with Parquet and other columnar storage formats, including partitioning strategies for performance and scale.
  • Experience designing ETL/ELT workflows that are reproducible, maintainable, and resilient to upstream dataset changes.
  • Ability to interpret messy or loosely documented datasets and design stable parsing logic.
  • Clear written communication skills for documenting processes, assumptions, and dataset behavior.
  • Experience with DataFusion, DuckDB, or other modern analytical engines is a plus.
  • Exposure to datasets such as regulatory filings, financial reference data, or alternative data is a plus.
  • Experience extracting structured information from PDFs or other irregular data sources is a plus.
  • Familiarity with schema validation, metadata management, or data-quality frameworks is a plus.

About Massive

At Massive we are on a mission to help developers build the future of fintech. We are committed to democratizing access to the world's financial market data and enabling developers to build the future of fintech. Join us and be part of a team that is revolutionizing the way we interact with money and value.

If you are a passionate problem-solver who thrives in a dynamic and innovative environment, we encourage you to apply!

Apply now

Please fill out the form or email your resume to careers@massive.com.

U.S. Equal Opportunity Employer

Massive is an equal opportunity employer and complies with all applicable federal, state, and local fair employment practices laws. We strictly prohibit and do not tolerate harassment or discrimination against employees, applicants, or any other covered persons because of race, color, religion, creed, national origin or ancestry, ethnicity, sex, gender, gender identity, age, physical or mental disability, citizenship, sexual orientation, past, current or prospective service in the uniformed services. To request a reasonable accommodation, please email careers@massive.com.

Benefits for full time offers from Massive include, but are not limited to, comprehensive medical plans, 401(k), and unlimited time off. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

Before submitting your application, please take a minute to review our Applicant Privacy Notice, which describes the data we collect, why we collect it, and how we use it. By submitting your application, you consent to our Applicant Privacy Notice.

Modernizing Wall St.

Reimagining financial market data for the 21st century.

Schedule a demo

Talk with our market data experts