• Location: Chantilly, Virginia
  • Type: Contract
  • Job #2281
Senior Data Engineer
Clearance:
TS/SCI with CI Poly
Chantilly, VA
 
We are looking for team-members with creative talent who are ready to take on the challenge of Senior Data Engineer to work collaboratively in order to further advance cutting-edge technology, products, and services for a large Government agency to detect and mitigate insider threats.
 
The Insider Threat Detection program is manned by a multi-disciplinary team of system and data engineers, data scientists, software developers, intelligence analysts, and investigators to provide insider threat detection and counterintelligence services. This program provides an opportunity to further advance cutting-edge technology, products, and services for a large Government agency to detect and mitigate insider threats. This program takes data from multiple sources in any format (structured or unstructured), transforms it into interpretable fragments, and allow our engines to categorize, quantify, distill, and display results for human analysts to interpret. Team members work closely with esteemed customers to develop solutions that allow them to carry out high-stakes national security missions. The technology stack is built on cutting edge hardware and software with multiple Windows and Linux environments interfacing with multi-petabyte data processing and analytic platforms – all designed, built, and maintained by our team.
 
As a Senior Data Engineer, you will help us expand our insider threat capabilities in automating data integration and collection strategies. The successful candidate will help expand and optimize the data ingestion pipeline architecture, develop strategies for efficient ingestion, processing, storage, structuring, and access. In addition, the Data Engineer will support data analysts, data scientists, and big data engineers in identifying data sources, performing exploratory data analysis, developing data models, ensuring data cleanliness and accuracy to provide new Insider Threat behavioral insights.
 
The field of insider threat detection and mitigation is evolving and growing, and our program needs highly innovative individuals, please keep reading…
 
Roles and responsibilities potentially include:
  • Support data science team by designing, developing and implementing scalable ETL process for disparate datasets into a Hadoop infrastructure
  • Design, develop, implement, and maintain data ingestion process from various disparate datasets using StreamSets (experience with StreamSets not mandatory)
  • Develop processes to identify data drift and malformed records
  • Develop technical documentation and standard operating procedures
  • Leads technical tasks for small teams or projects
 
Required Experience:
  • Working knowledge of entity resolution systems
  • Experience with Hadoop and Hive/Impala
  • Experience with messages systems like Kafka
  • Experience with NoSQL and/or graph databases like MongoDB or ArangoDB
  • Any of the following databases: SQL, MongoDB, Oracle, Postgres
  • Working experience with ETL processing and Python
  • Working experience with data workflow products like StreamSets or NiFi
  • Working experience with Python RESTful API services, JDBC
  • Experience with Cloudera Data Science Workbench is a plus
  • Understanding of pySpark
  • Leadership experience
  • Creative thinker
  • Ability to multi-task
  • Excellent use and understanding of data engineering concepts, principles, and theories.
Required Qualifications:
  • Bachelors of Science in a STEM (Science, Technology, Engineering, Mathematics) related field, plus 8 yrs or Masters degree plus 6 yrs.
Attach a resume file. Accepted file types are DOC, DOCX, PDF, HTML, and TXT.

We are uploading your application. It may take a few moments to read your resume. Please wait!