Classification Title:
Data Scientist II
Job Description:
The College of Medicine, Office of Research (OR) is seeking a Data Scientist II. This position will work with the Assistant Director of Research Data Management and Analytics to further build out the Research Analytics and Intelligence (rAI) unit within the office. The Data Scientist II will work within a multi-disciplinary team of AI, basic, and clinical researchers, IT and data analysts, medical sciences research administrators and coordinators, and other subject matter experts. The Analyst will work with OR leadership to define priorities and develop project roadmaps focused on the College of Medicine research mission through the investigation and operationalization of strategic intelligence, foresight and analytics for research and discovery. The candidate will support the OR leadership team on research portfolio analysis, research community and research topic analysis, faculty recruitment and retention support, and large grant development analytics. The candidate will be able to employ state-of-the art machine learning (both supervised and unsupervised) and other statistical and data visualization techniques of big data mining / data science to produce these tools and reports.
The incumbent will possess the experience and expertise to conduct in-depth data mining, define detailed data exploration and analytical protocols, have a strong statistical background to generalize methodologies, is proficient in the application of artificial intelligence, predictive modeling, data visualization for the business solutions and academic biomedical research and development.
If you want to be a part of a team that champions one another, dreams big, and aims high, we invite you to explore this opportunity further!
About This Role
Clinical Research Analytics and Report Creation
- Provide support for the College of Medicine’s Clinical Research Hub (CRH) mission through complex data-driven investigations with internal and external partners.
- Lead the planning and execution of data projects, ensuring alignment with business objectives by defining scopes, goals and deliverables in collaboration with stakeholders. Translate business requirements into technical solutions by building quality of life software tools that automate current manual processes.
- Design, develop, and maintain robust APIs and data pipelines to automate data extraction,
transformation, and loading (ETL). Collect, clean, and preprocess raw data from diverse sources to ensure high quality and readiness for analysis.
- Conduct in-depth data mining to evaluate clinical research metrics data, define detailed data exploration and analytical protocols, and have a strong statistical background to generalize methodologies.
- Identify potential data problems from the requested data or analytic queries and to take appropriate action to guide the resolution process.
- Design methodologies, document analyses, and translate findings into easily accessible
reports/presentations and interactive visualizations.
- Build and maintain relational, distributed, and databases that support customer applications for the Clinical Research Hub. Utilize federated learning platforms to implement deep learning models.
- Process, integrate, filter, analyze, validate, and visualize multivariate and multimodal datasets for institutional reporting from local databases for the Clinical Research Hub.
Ensure quality, consistency, and validity of large multimodal datasets for the Clinical Research Hub.
Research Intelligence
- Leverage advanced statistical modeling, artificial intelligence/machine learning, and big data to tailor accurate, precise, and meaningful intelligence solutions to administrators and faculty leaders.
- Collaborate with the College of Medicine’s Office of Research leadership to develop approaches for mapping the network of research interests and scientific investigations within the research enterprise and automate the identification of federal grant and legislative opportunities that align with those interests.
About COM's Office of Research
The College of Medicine's Office of Research collaborates closely with faculty, research leaders, and University administration to drive strategic initiatives that enhance research excellence and healthcare outcomes. The Office of Research focuses on diversifying their funding portfolio, interpreting sponsor guidelines, and facilitating collaborative, interdisciplinary research. This office provides vital administrative support to ensure compliance and promote research excellence across basic, translational, and clinical sciences.
For more information about the College of Medicine's Office of Research and its goals, visit College of Medicine Office of Research.
We Offer Exceptional Benefits
- Low-cost State Health Plans: Medical, Dental, and Vision Insurance
- Life and Disability Insurance
- Generous Retirement Options to Secure Your Future
- Comprehensive Paid Time Off Package (including 11 paid holidays, as well as paid family, sick, and vacation leave)
- Exceptional Personal and Professional Development Opportunities (UF Training & Organization Development including leadership development, LinkedIn Learning, amongst other opportunities)
- Tuition Assistance (UF Employee Education Program)
- Public Service Loan Forgiveness (PSLF) Eligible Employer
About the City of Gainesville
Discover Gainesville, Florida, home to the University of Florida College of Medicine, where modern attractions and natural beauty harmonize to create an exceptional living environment. Enjoy a low cost of living, no state income tax, outstanding public and private schools, and pleasant winters in a community that passionately supports Division I NCAA sports (Go Gators!). Explore scenic bike trails, lively farmer's markets, and a thriving local brewery scene. Immerse yourself in over 30 miles of biking and hiking trails, encounter diverse wildlife in Florida State Parks, and experience thrilling adventures in freshwater springs. Gainesville's central location offers easy access to both the Gulf of Mexico and the Atlantic Ocean, providing stunning beaches, nature preserves, and world-renowned theme parks within a day's drive. Become part of our vibrant community, where the perfect blend of opportunities awaits. Learn more about what Gainesville has to offer at Visit Gainesville.
Expected Salary:
$70,000 - $75,000
Minimum Requirements:
A Bachelor's Degree in data science, statistics, bioinformatics, analytics, or similar field and three years of experience; Master's Degree in data science, statistics, bioinformatics, analytics, or similar field and one year of experience; Doctoral Degree in data science, statistics, bioinformatics, analytics, or similar field.
Preferred Qualifications:
The ideal candidate will possess:
- Ph.D., M.S., or B.S. in Computer Science, Engineering, Math, Statistics, or a related field.
- Advanced proficiency in programming languages like Javascript, Python and Java.
- Experience with designing, building and maintaining APIs and microservices using frameworks like Flask, Django. Understanding of REST architecture, HTTP protocol, OAuth.
- Familiarity with databases (SQL and NoSQL) and understanding CRUD operations for database management systems like OracleDb, MySQL, MongoDb with experience.
- Familiarity with tools and platforms like Docker, Kubernetes, TensorFlow Serving, and cloud services (e.g., AWS, Google Cloud, Azure).
- Basic Software DevOps (Git, GitHub, version control, Postman, Swagger).
- Proven track record of successful ML implementations with experience developing and evaluating ML/DL models and summarizing and visualizing results.
- Experience in artificial intelligence, machine learning, and data engineering and the experience and expertise in the application of artificial intelligence, predictive modeling, data visualization for the business solutions and academic biomedical research and development.
- Familiarity with libraries such as PyTorch, TensorFlow NLTK, spaCy, and Hugging Face Transformers.
- Understanding of NLP techniques and tools, including tokenization, part-of-speech tagging, named entity recognition, and sentiment analysis.
- Experience with neural networks and deep learning architectures such as CNNs, RNNs, LSTMs, GANs, and transformers.
- Ability to create insightful visualizations using tools like Matplotlib, Seaborn, Plotly, and Tableau.
- Techniques for model tuning, hyperparameter optimization, and performance improvement.
- Strong interpersonal skills, can-do attitude, and ability to work both independently and as a team player in a fast-paced, cross-cultural environment.
- Tenacious, self-directed, and pragmatic with a growth mindset, always learning new technologies, and capable of creative problem-solving and effective communication under uncertainty.
Special Instructions to Applicants:
To be considered, please upload the following documents with your application:
- Cover letter
- Resume
- Contact information for three professional references
If an accommodation due to a disability is needed to apply for this position, please call 352-392-2477 or the Florida Relay System at 800-955-8771 (TDD).
Application must be submitted by 11:55 p.m. (ET) of the posting end date.
Health Assessment Required:
No