Core Responsibilities
1. Leverages data pipeline designs and supports the development of data pipelines to support model development. Proficient with software tools that develop data pipelines in a distributed computing environment (PySprak, GlueETL).
2. Supports integration of model pipelines in a production environment. Develops understanding of SDLC for model production.
3. Reviews pipeline designs, makes data model design changes as needed. Documents and reviews design changes with data science teams.
4. Supports data discovery & automated ingestion for model development. Performs detailed analysis of raw data sources for data quality, applies business context, and model development needs.
5. Engages with internal stakeholders to understand and probe business processes in order to develop hypotheses. Brings structure to requests and translates requirements into an analytic approach. Participates in and influences ongoing business planning and departmental prioritization activities.
6. Runs model monitoring scripts, follows process for alerts to management as needed. Addresses issues found in data pipelines from model monitoring alerts.
7. Participates in special projects and performs other duties as assigned.
Qualifications
How We Work
Vanguard has implemented a hybrid working model for the majority of our crew members, designed to capture the benefits of enhanced flexibility while enabling in-person learning, collaboration, and connection. We believe our mission-driven and highly collaborative culture is a critical enabler to support long-term client outcomes and enrich the employee experience.