About Interests Education Experience Projects Publications

Venkatesh Avula

Data Scientist | GenAI & ML Specialist | Applied AI Research

Seasoned Data Scientist with 15+ years of experience driving real-world impact through AI. Currently lead the evaluation of GenAI products, ensuring safe, reliable, and scalable deployment of AI across the organization. Known for translating complex AI systems into actionable, high-value outcomes through strong cross-functional leadership.

Previously led end-to-end development of predictive models in healthcare and finance, enabling early disease detection, operational efficiency, and risk management at scale. Committed to advancing applied AI through rigorous evaluation, research, and collaboration — with multiple peer-reviewed publications and a passion for building AI that works in the real world.

Interests

  • Applied Machine Learning
  • Generative AI Application
  • Predictive Modeling at Scale
  • Real-World AI Integration
  • Data-Driven Decision Systems

Education

  • Master's in Management Information Systems, 2013–2014
    Oklahoma State University, Stillwater, OK
  • Bachelor's in Mechanical Engineering, 2005–2009
    JNTU College of Engineering, Anantapur, India

Experience

Principal Data Scientist
Elsevier, Philadelphia, PA
NOV 2024 – CURRENT
At Elsevier, I lead the end-to-end evaluation of GenAI products by designing and deploying scalable, high-precision human-in-the-loop and automated assessment pipelines. This includes building rule-based engines and leveraging advanced techniques like LLM-as-judge and machine learning classifiers to measure model performance across multiple critical dimensions. I drive cross-functional collaboration with product, engineering, and leadership teams to ensure these systems deliver fast, cost-effective, and reliable evaluation at scale. My work focuses on enabling safe, high-quality AI development through robust, continuously improving feedback loops and platformized evaluation frameworks.
Lead Data Scientist
Geisinger Medical Center, Danville, PA
FEB 2018 – NOV 2024
At Geisinger, I played a pivotal role at the intersection of healthcare and Artificial Intelligence (AI), unraveling intricate patterns within extensive datasets that included millions of electronic health records, imaging files, and textual notes. Adept at developing innovative solutions, I addressed data challenges such as missing values, skewed distributions, and imbalances while ensuring unbiased insights. My expertise lay in crafting state-of-the-art predictive models geared towards early detection of critical health conditions, significantly contributing to the evolution of precision health. Moreover, I optimized the emergency triage admissions workflow, collaborating seamlessly with interdisciplinary teams comprising physicians, research scientists, and stakeholders. Through partnerships with multiple health systems, I extended the generalizability of predictive models, operationalizing them within the healthcare system to establish a feedback loop for continuous improvement, effectively addressing model and concept drifts. Beyond model development, my commitment to advancing AI in healthcare was underscored by the publication of groundbreaking research in reputable healthcare journals, furthering the integration of AI in healthcare practices.
Data Scientist
Freddie Mac, McLean, VA
JUN 2016 – JAN 2018
At Freddie Mac, I optimized the origination cycle time by implementing Optical Character Recognition (OCR) on digital documents. Additionally, I validated and improved the internal costing model for projecting loan defaults. Through segmentation analysis, I assessed the impact of changes in Loan-to-Value (LTV) buckets on various affordable lending programs. Furthermore, I investigated the influence of socioeconomic indicators on mortgage lending and refined automated valuation models for accuracy.
Senior Quantitative Analyst
GE, Norwalk, CT
SEP 2014 – JUN 2016
At GE, I led a global team in building models to predict customer responses to campaigns. Identified key customer behavior patterns using advanced clustering and segmentation algorithms. Collaborated with marketing teams to make fact-based decisions and tracked campaign performance against ROI goals.
Data Analyst
IBM, Kolkata, India
JUN 2009 – JUL 2013
At IBM, I conducted statistical data analysis and automated monthly performance report generation. Managed data munging tasks and developed ad-hoc reports for quantitative analysis with descriptive statistics using SAS procedures.

Projects

GenAI Evaluation Platform Led the development of a modular evaluation platform for generative AI products, enabling both human-in-the-loop and automated assessments across dimensions such as safety, factuality, and helpfulness. Built reusable components including rule-based scoring engines, LLM-as-judge orchestration, ML-based evaluators, and batch inference pipelines. Designed the system to support test-time evaluation, post-hoc analysis, and large-scale backfills with fast turnaround and minimal operational cost. Integrated with product and engineering teams to drive scalable, reliable GenAI measurement.
Predictive Modeling for Early Detection Developed machine learning models using structured EHR data and clinical text to predict early onset of critical conditions. Collaborated with clinical teams to deploy models in real-world workflows, enabling timely interventions and creating feedback loops to monitor drift and maintain performance.
Emergency Triage Optimization Created risk stratification models to support emergency department triage and admissions. Improved prioritization for high-risk patients, reduced time-to-treatment, and optimized bed utilization through data-driven workflow integration.

Publications

Machine learning-based Cerebral Venous Thrombosis diagnosis with clinical data
Journal of Stroke and Cerebrovascular Diseases · 2024
Machine Learning-Based Prediction of Stroke in Emergency Departments
Therapeutic Advances in Neurological Disorders · 2024