Full: Data Lab Reproducible Research Practices and Introduction to OpenScPCA Workshop, Philadelphia, May 14-15, 2024

April 5, 2024

Applications are open for the Data Lab's next workshop! We are holding a two-day course on Reproducible Research Practices and the Open Single-cell Pediatric Cancer Atlas (OpenScPCA) project from May 14-15, 2024. Please note that the OpenScPCA module is an optional part of the workshop.

The course begins with an introduction to principles and techniques to achieve reproducible results in computational cancer research. On day two, you can choose to continue the workshop and learn how to put your skills to use for OpenScPCA, our new pediatric cancer research project. We will teach you how to contribute and we'll get you completely set up so you can dive right into analysis as soon as the workshop ends! Learn more about the OpenScPCA project.

Some familiarity with basic coding concepts (e.g., defining variables, data structures, control flow structures) is expected.

Keep reading for more details about each day of the course and the OpenScPCA project. Information about applying and requesting travel reimbursement can be found below.

About the workshop

Day 1: Reproducible Research Practices 

Tuesday, May 14 from 9am-5pm Eastern time

Instructors will show you the fundamentals of commonly-used approaches in reproducibility that you can apply to increase the impact of your research by making your findings more robust and reliable.

We will cover some common practices for reproducible research, including:

  • Organizing your projects, including data, code, and documentation
  • Navigating your computer from the command line interface
  • Tracking and automating your work with scripts
  • Making your code more readable, robust, and reusable - by you and by others!
  • Maintaining and tracking changes in your projects and code over time with Git and GitHub
  • Managing and tracking software and package versions for improved reproducibility

We won’t have time to cover:

  • How to program in specific languages such as R or Python
  • All the features and foibles of Git and GitHub
  • Workflow management systems such as CWL, Snakemake, or Nextflow

Day 2 (Optional): Introduction to OpenScPCA

Wednesday, May 15 from 9am-5pm Eastern time

Learn how to contribute to an actual pediatric cancer research project! We will introduce you to OpenScPCA, an open, collaborative project to analyze single-cell data from over 50 pediatric cancer types. We encourage you to attend part two of the course if you are interested in contributing to OpenScPCA, or if you want to be a more effective collaborator for your own future projects! 

We will cover:

  • An introduction to OpenScPCA and ways to contribute
  • Data, tools, and software that are part of this project
  • How to get started, including onboarding and hands-on set up of your local environment
  • What to expect as a contributor, including:
    • Actions you'll take as a contributor
    • How you'll interact with the Data Lab project maintainers
    • The ins and outs of code review

Please note, you do not have to join OpenScPCA to attend or benefit from this part of the course. But after attending day two, you will have completed some onboarding steps, which will be helpful if you choose to contribute! If you can’t attend day two of the workshop, but are interested in OpenScPCA, we still want to hear from you.

Why join OpenScPCA?

This year, the Data Lab launched OpenScPCA to analyze and explore the single-cell and single-nuclei RNA-Sequencing data available on the ScPCA Portal. We are looking for researchers to join us, especially those with experience or expertise in pediatric cancer, single-cell data, labeling cell types or cell states, and pan-cancer analyses. 

Completing OpenScPCA will result in an openly licensed code base, knowledge that will live on through a peer-reviewed publication, and a resource that will benefit a broad community of pediatric cancer researchers! 

OpenScPCA contributors will:

  • Discover new datasets to advance their research
  • Meet new collaborators and join a supportive community 
  • Learn how to use powerful tooling for reproducible research and software development
  • Build their analysis portfolios and develop transferable skills
  • Possibly become eligible for a small one-time grant or be part of a future publication

Learn more during day two of this workshop!

Workshop details

The workshop will be held on May 14-15, 2024 from 9am-5pm Eastern time at One Bala Plaza, Bala Cynwyd, PA 19004 (just outside of Philadelphia). Each day will consist of lectures, hands on exercises, and time for open discussion with instructors and fellow participants. 

Participants should plan to bring their own laptop. We will provide breakfast, lunch, beverages, and snacks! Attendees will also be invited to join us for a group dinner.

Travel reimbursement

Travel reimbursement up to $500 is available for qualifying participants who reside over 50 miles from the workshop location. 

To qualify for reimbursement, you must:

  • Note this request on your application
  • Be able to provide documentation of your travel expenses
  • Be a childhood cancer researcher and attend at least the first day of the workshop
    • If you are a researcher who does not study childhood cancer, you can still request travel reimbursement. But you must attend both full days of the workshop to qualify.

Apply for the workshop!

If this sounds like it’s for you, please submit an application as soon as possible! Be sure to indicate whether you are applying for day one only or for the entire workshop.

Space is extremely limited and applications will be reviewed on a rolling basis. Applications will close when all spots are full, or on April 30, whichever comes first.

Accepted workshop participants will be asked to provide a $100 deposit to reserve their seat, whether or not they are attending both days of the course. Deposits will be fully refunded upon workshop attendance.

Please reach out to us at training@ccdatalab.org with any questions!

Applications are open for the Data Lab's next workshop! We are holding a two-day course on Reproducible Research Practices and the Open Single-cell Pediatric Cancer Atlas (OpenScPCA) project from May 14-15, 2024. Please note that the OpenScPCA module is an optional part of the workshop.

The course begins with an introduction to principles and techniques to achieve reproducible results in computational cancer research. On day two, you can choose to continue the workshop and learn how to put your skills to use for OpenScPCA, our new pediatric cancer research project. We will teach you how to contribute and we'll get you completely set up so you can dive right into analysis as soon as the workshop ends! Learn more about the OpenScPCA project.

Some familiarity with basic coding concepts (e.g., defining variables, data structures, control flow structures) is expected.

Keep reading for more details about each day of the course and the OpenScPCA project. Information about applying and requesting travel reimbursement can be found below.

About the workshop

Day 1: Reproducible Research Practices 

Tuesday, May 14 from 9am-5pm Eastern time

Instructors will show you the fundamentals of commonly-used approaches in reproducibility that you can apply to increase the impact of your research by making your findings more robust and reliable.

We will cover some common practices for reproducible research, including:

  • Organizing your projects, including data, code, and documentation
  • Navigating your computer from the command line interface
  • Tracking and automating your work with scripts
  • Making your code more readable, robust, and reusable - by you and by others!
  • Maintaining and tracking changes in your projects and code over time with Git and GitHub
  • Managing and tracking software and package versions for improved reproducibility

We won’t have time to cover:

  • How to program in specific languages such as R or Python
  • All the features and foibles of Git and GitHub
  • Workflow management systems such as CWL, Snakemake, or Nextflow

Day 2 (Optional): Introduction to OpenScPCA

Wednesday, May 15 from 9am-5pm Eastern time

Learn how to contribute to an actual pediatric cancer research project! We will introduce you to OpenScPCA, an open, collaborative project to analyze single-cell data from over 50 pediatric cancer types. We encourage you to attend part two of the course if you are interested in contributing to OpenScPCA, or if you want to be a more effective collaborator for your own future projects! 

We will cover:

  • An introduction to OpenScPCA and ways to contribute
  • Data, tools, and software that are part of this project
  • How to get started, including onboarding and hands-on set up of your local environment
  • What to expect as a contributor, including:
    • Actions you'll take as a contributor
    • How you'll interact with the Data Lab project maintainers
    • The ins and outs of code review

Please note, you do not have to join OpenScPCA to attend or benefit from this part of the course. But after attending day two, you will have completed some onboarding steps, which will be helpful if you choose to contribute! If you can’t attend day two of the workshop, but are interested in OpenScPCA, we still want to hear from you.

Why join OpenScPCA?

This year, the Data Lab launched OpenScPCA to analyze and explore the single-cell and single-nuclei RNA-Sequencing data available on the ScPCA Portal. We are looking for researchers to join us, especially those with experience or expertise in pediatric cancer, single-cell data, labeling cell types or cell states, and pan-cancer analyses. 

Completing OpenScPCA will result in an openly licensed code base, knowledge that will live on through a peer-reviewed publication, and a resource that will benefit a broad community of pediatric cancer researchers! 

OpenScPCA contributors will:

  • Discover new datasets to advance their research
  • Meet new collaborators and join a supportive community 
  • Learn how to use powerful tooling for reproducible research and software development
  • Build their analysis portfolios and develop transferable skills
  • Possibly become eligible for a small one-time grant or be part of a future publication

Learn more during day two of this workshop!

Workshop details

The workshop will be held on May 14-15, 2024 from 9am-5pm Eastern time at One Bala Plaza, Bala Cynwyd, PA 19004 (just outside of Philadelphia). Each day will consist of lectures, hands on exercises, and time for open discussion with instructors and fellow participants. 

Participants should plan to bring their own laptop. We will provide breakfast, lunch, beverages, and snacks! Attendees will also be invited to join us for a group dinner.

Travel reimbursement

Travel reimbursement up to $500 is available for qualifying participants who reside over 50 miles from the workshop location. 

To qualify for reimbursement, you must:

  • Note this request on your application
  • Be able to provide documentation of your travel expenses
  • Be a childhood cancer researcher and attend at least the first day of the workshop
    • If you are a researcher who does not study childhood cancer, you can still request travel reimbursement. But you must attend both full days of the workshop to qualify.

Apply for the workshop!

If this sounds like it’s for you, please submit an application as soon as possible! Be sure to indicate whether you are applying for day one only or for the entire workshop.

Space is extremely limited and applications will be reviewed on a rolling basis. Applications will close when all spots are full, or on April 30, whichever comes first.

Accepted workshop participants will be asked to provide a $100 deposit to reserve their seat, whether or not they are attending both days of the course. Deposits will be fully refunded upon workshop attendance.

Please reach out to us at training@ccdatalab.org with any questions!

Back To Blog