# Data Science

- Do you ever subsample sample data?
- For "Scale by Subset", how are the scaled values calculated?
- How do you handle batch effects?
- What is the minimum number of cells in a cluster?
- Which distance metric do you use for the hierarchical clustering?
- Which statistical test do you use for the differential abundance analysis?