Job Summary
Bring a combination of mathematical rigor and innovative algorithm design to create recipes that extract relevant insights from billions of rows of data to meaningfully improve Comcast user experience.Job Description
Core Responsibilities
Selecting and transforming features, building and optimizing classifiers using machine learning techniques
Integrating data from multiple sources including third party sources.
Data mining using state-of-the-art methods
Enhancing data collection procedures to include information that is relevant for building analytic systems
Frequent meeting/communication with stakeholders to interpret their needs, plan/organize, and discuss progress and results
Developing actionable quantitative models in the areas of effectiveness, ROI, pricing and optimization.
Doing ad-hoc analysis and presenting results in a clear manner
Creating automated anomaly detection systems and constant tracking of its performance
Creating automated evaluation environment of complex models and constant tracking of relevant performance
Develop and communicate goals, strategies, tactics, project plans, timelines, and key performance metrics to reach goals
Here are some of the specific technologies we use:
Spark (AWS EMR, Databricks), AWS Lambda
Spark Streaming and Batch
Avro, Parquet
Stream Data Platforms: Kafka, AWS Kinesis
MySQL, Cassandra, HBase, MongoDB, RDBMS
Caching Frameworks(ElasticCache/Redis)
Elasticsearch, Beats, Logstash, Kibana
Java, Scala, Go, Python, R
Git, Maven, Gradle, Jenkins
Rancher, Puppet, Concourse, Docker, Ansible, Kubernetes
Linux
Hadoop (HDFS, YARN, ZooKeeper, Hive), Presto, Athena
Keras, TensorFlow, Scikit.learn, Pandas)
Visualization suite (AWS Quicksight, Grafana)
Skills & Requirements:
Graduate degree or Phd in the following areas: Statistics, Data Science, Computer Science or relevant science or engineering discipline.
1+ years working within an enterprise data lake/warehouse environment or big data architecture
Understanding of machine learning techniques and algorithms, especially in the deep learning area -- both theoretical underpinnings and craft (Systems such as Tensorflow, Theano, Caffe, scikit.learn and their APIs).
Applied statistics skills and understanding of probability distributions, statistical testing, regression, etc.
Experience with common data science toolkits, such as scikit-learn, R, etc. Excellence in at least one of these is highly desirable.
Great communication skills.
Experience with data visualization tools, such as D3.js, GGplot, Matplotlib, etc.
Proficiency in using query languages such as SQL and Hive.
Experience with NoSQL databases, such as MongoDB, Redis/ElasticCache, Cassandra, HBase
Good scripting and programming skills, such as Java, Scala, R, Python, or Spark
Data-oriented personality
Disclaimer:
Skills
Data Science, Decision Making, Machine Learning, Problem Solving, Python (Programming Language)We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That's why we provide an array of options, expert guidance and always-on tools that are personalized to meet the needs of your reality—to help support you physically, financially and emotionally through the big milestones and in your everyday life.
Please visit the benefits summary on our careers site for more details.
Education
Bachelor's DegreeWhile possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience.Certifications (if applicable)
Relevant Work Experience
7-10 YearsComcast is proud to be an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law.PA - Philadelphia, 1800 Arch St, United States