Biomanufacturing Consortium for Analytics and Data Engineering (BioCAD) Expansion

BioCAD Expansion is a collaborative technical training program replication and expansion effort centered on the West Coast. It will develop and offer mezzanine-level training modules.
Categories
Incumbent worker training

Industry Need

The biopharmaceutical industry is experiencing a rapid shift toward digital transformation, requiring employees to apply data analytics to optimize biomanufacturing processes, improve efficiency, and ensure quality. However, most incumbent workers and emerging talent lack practical training in programming languages like Python and in applying analytics to real-world bioprocessing contexts. Existing programs are often costly, highly specialized, or inaccessible to non-data-science professionals, creating a critical skills gap that limits the adoption of data-driven decision-making across manufacturing, operations, and supply chain functions.

To remain competitive and meet growing demand for biologics, the industry needs scalable, flexible training solutions that introduce foundational data programming and analytics concepts in biomanufacturing applications. Programs must be affordable, asynchronous, and adaptable for diverse learners—from community college students to incumbent employees—while providing hands-on practice with real datasets and industry-relevant case studies. Addressing this need will enable companies to build a data-enabled workforce capable of leveraging analytics for process optimization, risk reduction, and operational excellence.

Approach

The Biomanufacturing Consortium for Analytics and Data engineering (BioCAD) Expansion is a collaborative technical training program replication and expansion effort centered on the West Coast. The proposed 18-month project recruits additional team members and builds upon a NIIMBL-funded workforce training pilot launched in 2020. 

Growing from 14 hours of existing introductory content, the BioCAD Expansion will: 

  • Develop and offer mezzanine-level training modules—between professional development and academic offerings—targeting scientists, manufacturing engineers, process development engineers, and operations specialists (not trained data scientists).  
  • Package modules for asynchronous and hybrid delivery and integration into different employee learning contexts, focusing on active-learning strategies that empower participants to apply knowledge and practice data programming skills on realistic problems.  
  • Scaffold 26 additional hours of primer, introductory, and intermediate training content in bioprocessing, operations, and supply chain analytics. 
  • Test training modules by approximately 150 incumbent workers and collegiate students in multiple corporate and academic contexts.  

Impacts

Creation and development of mezzanine-level training modules between professional development and academic offerings—targeting scientists, manufacturing engineers, process development engineers, and operations specialists.

Designed, tested, and delivered a virtual asynchronous training program in data programming for biopharmaceutical manufacturing, inclusive of four introductory and intermediate courses.

Introduced the analysis of upstream and downstream processes as well as business operations and supply chain topics. It is targeted to industry professionals in non-data-science roles across bioprocess development, manufacturing engineering, scientific, and business operations roles.

Also the first program of its kind that teaches Python specifically for a biomanufacturing or bioprocessing context outside of university courses.

Piloted via a low-cost online learning platform with more than 70 collegiate and incumbent worker participants enrolling in the program (a public launch is planned in late 2023 with an additional four hours of recorded expert panels, statistical analysis tutorials, and introductory biomanufacturing content).

Value Statement/Outcomes

The BioCAD Expansion project delivers transformative value by creating the first scalable, asynchronous training program that teaches Python and data analytics specifically for biomanufacturing applications. This initiative accelerates digital transformation and expands workforce equity by providing high-quality, application-focused training at a fraction of the cost of traditional programs. BioCAD equips engineers, scientists, and operations managers with practical skills to analyze upstream and downstream processes, business operations, and supply chain data—strengthening the industry’s ability to meet evolving challenges.

Key Outcomes

  • Developed and launched four introductory and intermediate courses with 25+ hours of modularized content, including video tutorials, datasets, and industry expert panels.
  • Piloted the program with 70+ participants and hosted in-person boot camps for students and instructors at community college and high school levels.
  • Secured a $299,000 foundation grant to expand BioCAD for collegiate pipelines, targeting 85 students and 10–20 instructors in upcoming cohorts.


Outputs/Deliverables

Produced ~25 hours of modularized, professionally recorded video tutorials, along with datasets, data programming practice problems, and industry expert panels/spotlight videos.

Piloted in-person boot camps using course content for community college, undergraduate, and high school students and instructors.

Leveraged BioCAD course content to host a Python training workshop for high school instructors from Alameda School District.

Developed content to support integration of data analytics and programming into biology and computer science curricula.

Created a foundation for scalable, blended learning experiences combining digital resources with hands-on training.

Publications

Fu, Q., Lee, Y. S., Green, E. A., Wang, Y., Park, S. Y., Polanco, A., Lee, K. H., Betenbaugh, M., McNally, D., & Yoon, S. (2023). Design space determination to optimize DNA complexation and full capsid formation in transient rAAV manufacturing. Biotechnology and Bioengineering, 120(11), 3148-3162. https://doi.org/10.1002/bit.28508

Wang, Y., Fu, Q., Lee, Y. S., Sha, S., & Yoon, S. (2023). Transcriptomic features reveal molecular signatures associated with recombinant adeno-associated virus production in HEK293 cells. Biotechnology Progress, 39(4). https://doi.org/10.1002/btpr.3346

Presentations

Biomanufacturing Consortium for Analytics and Data Engineering (BioCAD) Expansion (PC4.1-214), NIIMBL Member Forum, Virtual, October 27, 2022.

Additional Project Information (Members Only)

Login to the NIIMBL member portal to access additional project information, including presentations, progress updates, reports, and more.

Not yet a member? Learn more about which level of NIIMBL membership is right for you and your organization.

Project Lead

Santa Clara University

Santa Clara University

Participating Organizations

Genentech, Inc.

Genentech, Inc.

Keck Graduate Institute

Keck Graduate Institute

Life Science Washington

Life Science Washington

Oregon Life Sciences

Oregon Life Sciences

Pfizer, Inc.

Pfizer, Inc.