Data Engineer SME
Anavation
Job Summary
AnaVation is seeking a Senior-level Data Engineer to develop and implement advanced data pipelines and ETL processes for classified environments. This role involves facilitating bulk analysis of relational information, integrating diverse intelligence data sources, and building applications that deliver actionable intelligence. Responsibilities include designing enterprise database systems, optimizing large relational databases, managing Elasticsearch/Opensearch clusters, and implementing CI/CD pipelines, while providing technical leadership.
Must Have
- Bachelor's degree in Computer Science, Data Science, Engineering, or related field
- Minimum of 10 years of experience in data engineering
- Active Top Secret (TS) clearance with SCI eligibility
- Experience with SAFe Agile framework
- Strong understanding of forensic and investigative data requirements
- Demonstrated experience designing and implementing data solutions in secure government environments
- Advanced proficiency with Python for data processing and ETL
- Advanced proficiency with SQL and query optimization
- Advanced proficiency with Elasticsearch or Opensearch
- Advanced proficiency with Data pipeline technologies (Apache Nifi, Cribl)
- Advanced proficiency with cross-domain ETL solutions
- Advanced proficiency with Docker and Kubernetes
- Advanced proficiency with Cloud platforms (AWS GovCloud, SC2S, C2S)
- Experience handling data in accordance with IC security protocols
- IC Data Services frameworks and integration protocols
- Advanced proficiency with GraphQL
- Advanced proficiency with DevSecOps practices and tools
- Advanced proficiency with RabbitMQ and Redis
Good to Have
- Experience with Intelligence Community Data Services and standards
- Prior work with cross-domain solutions for ETL processes
- Knowledge of IC specific data formats, schemas and transfer protocols
- Experience with specialized IC analytics platforms and data repositories
- Prior work integrating intelligence data across multiple enclaves
- Cloud certification
Perks & Benefits
- Generous cost sharing for medical insurance for the employee and dependents
- 100% company paid dental insurance for employees and dependents
- 100% company paid long-term and short term disability insurance
- 100% company paid vision insurance for employees and dependents
- 401k plan with generous match and 100% immediate vesting
- Competitive Pay
- Generous paid leave and holiday package
- Tuition and training reimbursement
- Life and AD&D Insurance
Job Description
Be Challenged and Make a Difference
In a world of technology, people make the difference. We believe if we invest in great people, then great things will happen. At AnaVation, we provide unmatched value to our customers and employees through innovative solutions and an engaging culture.
Description of Task to be Performed:
AnaVation is seeking a highly skilled Senior-level Data Engineer to join our team deliver engineering tasks to advanced analytic applications across classified environments. You will develop sophisticated data pipelines and ETL processes for desktop and web-based analytic software, facilitate the bulk analysis of relational information, and enable applications that deliver data from diverse data sources.
Key Responsibilities:
- Design and implement complex data pipelines and ETL processes to support cyber investigative capabilities across multi-classification domains
- Architect and develop ETL workflows for highly sensitive data in classified environments
- Support IC Data Services requirements by integrating various intelligence data sources and systems
- Develop and maintain data analytics solutions for desktop and web-based visual analytic applications
- Establish applications that produce manageable, actionable intelligence from streams of structured and semi-structured data
- Design strategies for enterprise database systems and set standards for operations, programming, and security
- Construct and optimize large relational databases across multi-enclave environments (Unclassified, Secret, and Top Secret)
- Tune performance of large-scale data workflows, ensuring cost efficiency, low latency, and high availability
- Design and manage Elasticsearch/Opensearch clusters for fast search, indexing, and retrieval of large-scale datasets
- Integrate new systems with existing warehouse structures and refine system performance and functionality
- Implement CI/CD pipelines for data systems, automate monitoring/alerting, and enforce infrastructure-as-code practices
- Provide technical leadership and mentorship to other team members
- Participate in Program Increments (PIs) and Agile Release Train (ART) activities
Required Qualifications:
- Bachelor's degree in Computer Science, Data Science, Engineering, or related field
- Minimum of 10 years of experience in data engineering or related field
- Active Top Secret (TS) clearance with eligibility for Sensitive Compartmented Information (SCI)
- Experience with SAFe Agile framework
- Strong understanding of forensic and investigative data requirements
- Demonstrated experience designing and implementing data solutions in secure government environments
Advanced proficiency with
- Python for data processing, automation and ETL workflow orchestration
- SQL (MySQL, PostgreSQL, Microsoft SQL) and query optimization
- Elasticsearch or Opensearch (design, scaling, query optimization, cluster management)
- Data pipeline technologies (Apache Nifi, Cribl)
- Cross-domain ETL solutions for secure data transfer between classification levels
- Containerization and orchestration technologies (Docker, Kubernetes)
- Cloud platforms (AWS GovCloud, SC2S, C2S)
- Experience handling data in accordance with IC security protocols and classification guidelines
- IC Data Services frameworks and integration protocols
- GraphQL: schema design, API development, query optimization, and integrations
- DevSecOps practices and tools in classified environments
- RabbitMQ and Redis
Work Environment
- Primary location: Chantilly, VA
- May require occasional travel for Program Increment planning sessions
- Must be a U.S. citizen and able to pass a background check and polygraph examination
- May require flexible scheduling to support critical operations
Preferred Qualifications:
- Experience with Intelligence Community Data Services and standards
- Prior work with cross-domain solutions for ETL processes
- Knowledge of IC specific data formats, schemas and transfer protocols
- Experience with specialized IC analytics platforms and data repositories
- Prior work integrating intelligence data across multiple enclaves
- Cloud certification
Benefits
- Generous cost sharing for medical insurance for the employee and dependents
- 100% company paid dental insurance for employees and dependents
- 100% company paid long-term and short term disability insurance
- 100% company paid vision insurance for employees and dependents
- 401k plan with generous match and 100% immediate vesting
- Competitive Pay
- Generous paid leave and holiday package
- Tuition and training reimbursement
- Life and AD&D Insurance
About AnaVation
AnaVation is the leader in solving the most complex technical challenges for collection and processing in the U.S. Federal Intelligence Community. We are a US owned company headquartered in Chantilly, Virginia. We deliver groundbreaking research with advanced software and systems engineering that provides an information advantage to contribute to the mission and operational success of our customers. We offer complex challenges, a top-notch work environment, and a world-class, collaborative team.
If you want to grow your career and make a difference while doing it, AnaVation is the perfect fit for you!
AnaVation is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.