Data Scientist
Kavalirio
Job Summary
Kavaliro is seeking a Data Scientist to provide highly technical and in-depth data engineering support. The role requires experience designing and building data infrastructure, developing data pipelines, transforming and preparing data, ensuring data quality and security, and monitoring and optimizing systems. Extensive experience with Python and AWS is a must. The candidate will also work with SQL, various database technologies, NiFi, Git, Elasticsearch, Kibana, Jupyter Notebooks, NLP, AI, and data visualization tools.
Must Have
- Design and build data infrastructure
- Develop data pipelines
- Transform and prepare data
- Ensure data quality and security
- Monitor and optimize data systems
- Program with Python
- Build scalable ETL and ELT workflows for reporting and analytics
- Work with SQL and complex multi-data source queries
- Use code repositories such as Git
- Utilize Elastic and Kibana
- Apply machine learning techniques including natural language processing
- Hold a TS/SCI with Full Scope Polygraph clearance
- Be a permanent U.S. citizen
Good to Have
- Experience with cloud services like AWS
- Experience with cloud data technologies and architecture
- Experience using big data processing tools such as Apache Spark or Trino
- Experience with machine learning algorithms
- Experience with container frameworks such as Docker or Kubernetes
- Experience with data visualizations tools such as Tableau, Kibana or Apache Superset
- Experience creating learning objectives and teaching curriculum
Job Description
Job Description
---------------
Kavaliro is seeking a Data Scientist to provide highly technical and in-depth data engineering support. The candidate MUST have experience designing and building data infrastructure, developing data pipelines, transforming and preparing data, ensuring data quality and security, and monitoring and optimizing systems. The candidate MUST have extensive experience with Python and AWS. Experience with SQL, multi-data source queries with database technologies (PostgreSQL, MySQL, RDS, etc.), NiFi, Git, Elasticsearch, Kibana, Jupyter Notebooks, NLP, AI, and any data visualization tools (Tableau, Kibana, Qlik, etc.) are desired.
Required Skills and Demonstrated Experience
- Demonstrated experience with data engineering, to include designing and building data infrastructure, developing data pipelines, transforming/preparing data, ensuring data quality and security, and monitoring/optimizing systems.
- Demonstrated experience with data management and integration, including designing and perating robust data layers for application development across local and cloud or web data sources.
- Demonstrated work experience programming with Python
- Demonstrated experience building scalable ETL and ELT workflows for reporting and analytics.
- Demonstrated experience with general Linux computing and advanced bash scripting
- Demonstrated experience with SQL.
- Demonstrated experience constructing complex multi-data source queries with database technologies such as PostgreSQL, MySQL, Neo4J or RDS
- Demonstrated experience processing data sources containing structured or unstructured data
- Demonstrated experience developing data pipelines with NiFi to bring data into a central environment
- Demonstrated experience delivering results to stakeholders through written documentation and oral briefings
- Demonstrated experience using code repositories such as Git
- Demonstrated experience using Elastic and Kibana
- Demonstrated experience working with multiple stakeholders
- Demonstrated experience documenting such artifacts as code, Python packages and methodologies
- Demonstrated experience using Jupyter Notebooks
- Demonstrated experience with machine learning techniques including natural language processing
- Demonstrated experience explaining complex technical issues to more junior data scientists, in graphical, verbal, or written formats
- Demonstrated experience developing tested, reusable and reproducible work
- Work or educational background in one or more of the following areas: mathematics, statistics, hard sciences (e.g. Physics, Computational Biology, Astronomy, Neuroscience, etc.) computer science, data science, or business analytics
Desired Skills and Demonstrated Experience
- Demonstrated experience with cloud services, such as AWS, as well as cloud data technologies and architecture.
- Demonstrated experience using big data processing tools such as Apache Spark or Trino
- Demonstrated experience with machine learning algorithms
- Demonstrated experience with using container frameworks such as Docker or Kubernetes
- Demonstrated experience with using data visualizations tools such as Tableau, Kibana or Apache Superset
- Demonstrated experience creating learning objectives and creating teaching curriculum in technical or scientific fields
Location:
- McLean, Virginia
- This position is onsite and there is no remote availability.
Clearance:
- TS/SCI with Full Scope Polygraph
- Applicant MUST hold a permanent U.S. citizenship for this position in accordance with government contract requirements.
Kavaliro provides Equal Employment Opportunities to all employees and applicants. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Kavaliro is committed to the full inclusion of all qualified individuals. In keeping with our commitment, Kavaliro will take the steps to assure that people with disabilities are provided reasonable accommodations. Accordingly, if reasonable accommodation is required to fully participate in the job application or interview process, to perform the essential functions of the position, and/or to receive all other benefits and privileges of employment, please respond to this posting to connect with a company representative.
By using best practices and optimal employee recruiting strategies, Kavaliro provides employers with employment solutions by providing the most qualified and professional employees, who can staff both project and permanent positions in order to ensure the ongoing success of all types of businesses. We use a streamlined-yet-thorough approach to staffing that saves our clients administrative time, resources and money.