ManTech seeks a motivated, career and customer oriented Senior Data Scientist SME to join our team in Ashburn, Virginia. This position is onsite two days a week.
Each day U.S. Customs and Border Protection (CBP) oversees the massive flow of people, capital, and products that enter and depart the United States via air, land, sea, and cyberspace. The volume and complexity of both physical and virtual border crossings require the application of solutions to promote efficient trade and travel.
Responsibilities include but are not limited to:
Lead and perform hands-on data and threat/intel analysis leading to development of analytics solutions (e.g. predictive models, visual analytics reports), to support CBP users conduct law enforcement mission critical activities.
Demonstrate proficiency in extracting, cleaning, and transforming CBP transactional and associated data sets within an identified problem space to build predictive models as well as develop appropriate supporting documentation.
Leverage knowledge of a variety of statistical and machine learning techniques to develop, evaluate, and deploy new predictive analytical models that directly inform mission decisions.
Utilize and explore variety of statistical/modeling tools and languages to compare and assess best performing Machine Learning results.
Execute projects including those intended to identify patterns and/or anomalies in large datasets; perform automated text/data classification and categorization as well as entity recognition, resolution and extraction; and named entity matching.
Minimum Qualifications:
HS Diploma/GED and 20+ years or AS/AA and 18+ years or BS/BA and 12+ years or MS/MA/MBA and 9+ years or PhD/Doctorate and 7+ years
Experience in full-lifecycle development, deployment and monitoring of machine learning models to multiple platforms (on-prem/cloud etc.) and applying advanced analytics solutions to solve complex business problems
Experience with programming languages including R, Python, Scala, Java, SQL/Spark
Experience constructing and executing queries to extract data in support of EDA and model development
Experience with evaluating, implementing and optimizing AI/ML algorithms to address constraints with large and imbalanced datasets.
Experience with entity resolution (e.g., record linking, named entity matching, deduplication/ disambiguation)
Experience with unsupervised and supervised machine learning techniques and methods
Experience/Proficiency in conducting development and integration activities to deploy, assess and update AI/ML models into applications for end-user use and evaluation.
Preferred Qualifications:
Proficiency with Unsupervised Machine Learning methods including Cluster Analysis (e.g., K-means, K-nearest Neighbor, Hierarchical, Deep Belief Networks, Principal Component Analysis), Segmentation, etc.
Proficiency with Auto ML tools and platforms, such as AWS Sagemaker, DataRobot or DataBricks
Experience with big data technologies (e.g., Hadoop, HIVE, HDFS, HBase, MapReduce, Spark, Kafka, Sqoop)
Master’s Degree in mathematics, statistics, computer science/engineering, or other related technical fields with equivalent practical experience
Clearance Requirements:
Must be a US Citizen and able to obtain and maintain a U.S. Customs and Border Protection (CBP) suitability.
Must be eligible to obtain and maintain a Top Secret
Physical Requirements:
Must be able to be in a stationary position more than 50% of the time.
Must be able to communicate, converse, and exchange information with peers and senior personnel.
Constantly operates a computer and other office productivity machinery, such as a computer.