Extensive experience in Big Data space (Hadoop Stack like M/R, HDFS, Pig, Hive, HBase, Flume, Sqoop, NoSQL stores like Cassandra, HBase etc.) across Fractal and contributes to open-source Big Data technologies.
Write and tune complex Java, MapReduce, and Hive jobs.
Experience leading a Backend/Distributed Data Systems team while remaining hands-on is very important.
Manage the business intelligence team and vendor partners, ensuring to prioritize projects according to customer and internal needs, and develops top-quality dashboards using industry best practices.
Manage team of data engineers (both full-time associates and/or third-party resources)
Analyzes and confirms the integrity of source data to be evaluated.
Leads in deployment and auditing models and attributes for accuracy.
Experience with stream-processing systems: Spark-Streaming, Strom etc.
Experience with object-oriented/object function scripting languages: Python, Scala etc.
Experience in designing and building dimensional data models to improve accessibility, efficiency, and quality of data.
Should be proficient in writing Advanced SQLs, Expertise in performance tuning of SQLs. Experience with data science and machine learning tools and technologies is a plus.
Experience with relational SQL and NoSQL databases, including Postgres and Cassandra.