The biopharmaceutical industry is experiencing a rapid shift toward digital transformation, requiring employees to apply data analytics to optimize biomanufacturing processes, improve efficiency, and ensure quality. However, most incumbent workers and emerging talent lack practical training in programming languages like Python and in applying analytics to real-world bioprocessing contexts. Existing programs are often costly, highly specialized, or inaccessible to non-data-science professionals, creating a critical skills gap that limits the adoption of data-driven decision-making across manufacturing, operations, and supply chain functions.
To remain competitive and meet growing demand for biologics, the industry needs scalable, flexible training solutions that introduce foundational data programming and analytics concepts in biomanufacturing applications. Programs must be affordable, asynchronous, and adaptable for diverse learners—from community college students to incumbent employees—while providing hands-on practice with real datasets and industry-relevant case studies. Addressing this need will enable companies to build a data-enabled workforce capable of leveraging analytics for process optimization, risk reduction, and operational excellence.
The Biomanufacturing Consortium for Analytics and Data engineering (BioCAD) Expansion is a collaborative technical training program replication and expansion effort centered on the West Coast. The proposed 18-month project recruits additional team members and builds upon a NIIMBL-funded workforce training pilot launched in 2020.
Growing from 14 hours of existing introductory content, the BioCAD Expansion will:
Creation and development of mezzanine-level training modules between professional development and academic offerings—targeting scientists, manufacturing engineers, process development engineers, and operations specialists.
Designed, tested, and delivered a virtual asynchronous training program in data programming for biopharmaceutical manufacturing, inclusive of four introductory and intermediate courses.
Introduced the analysis of upstream and downstream processes as well as business operations and supply chain topics. It is targeted to industry professionals in non-data-science roles across bioprocess development, manufacturing engineering, scientific, and business operations roles.
Also the first program of its kind that teaches Python specifically for a biomanufacturing or bioprocessing context outside of university courses.
Piloted via a low-cost online learning platform with more than 70 collegiate and incumbent worker participants enrolling in the program (a public launch is planned in late 2023 with an additional four hours of recorded expert panels, statistical analysis tutorials, and introductory biomanufacturing content).
The BioCAD Expansion project delivers transformative value by creating the first scalable, asynchronous training program that teaches Python and data analytics specifically for biomanufacturing applications. This initiative accelerates digital transformation and expands workforce equity by providing high-quality, application-focused training at a fraction of the cost of traditional programs. BioCAD equips engineers, scientists, and operations managers with practical skills to analyze upstream and downstream processes, business operations, and supply chain data—strengthening the industry’s ability to meet evolving challenges.
Key Outcomes
Produced ~25 hours of modularized, professionally recorded video tutorials, along with datasets, data programming practice problems, and industry expert panels/spotlight videos.
Piloted in-person boot camps using course content for community college, undergraduate, and high school students and instructors.
Leveraged BioCAD course content to host a Python training workshop for high school instructors from Alameda School District.
Developed content to support integration of data analytics and programming into biology and computer science curricula.
Created a foundation for scalable, blended learning experiences combining digital resources with hands-on training.
Fu, Q., Lee, Y. S., Green, E. A., Wang, Y., Park, S. Y., Polanco, A., Lee, K. H., Betenbaugh, M., McNally, D., & Yoon, S. (2023). Design space determination to optimize DNA complexation and full capsid formation in transient rAAV manufacturing. Biotechnology and Bioengineering, 120(11), 3148-3162. https://doi.org/10.1002/bit.28508
Wang, Y., Fu, Q., Lee, Y. S., Sha, S., & Yoon, S. (2023). Transcriptomic features reveal molecular signatures associated with recombinant adeno-associated virus production in HEK293 cells. Biotechnology Progress, 39(4). https://doi.org/10.1002/btpr.3346
Biomanufacturing Consortium for Analytics and Data Engineering (BioCAD) Expansion (PC4.1-214), NIIMBL Member Forum, Virtual, October 27, 2022.
Login to the NIIMBL member portal to access additional project information, including presentations, progress updates, reports, and more.
Not yet a member? Learn more about which level of NIIMBL membership is right for you and your organization.

Santa Clara University
Genentech, Inc.
Keck Graduate Institute
Life Science Washington
Oregon Life Sciences
Pfizer, Inc.