Biostatistics - Exam
  • 1. Biostatistics is a branch of statistics that deals with data related to living organisms. It involves the design, analysis, and interpretation of data in fields such as biology, medicine, public health, and environmental science. Biostatistics plays a crucial role in research studies, clinical trials, and public health initiatives by providing statistical methods to analyze data, draw conclusions, and make informed decisions. It helps in understanding patterns of diseases, identifying risk factors, evaluating treatment interventions, and predicting health outcomes. Biostatisticians use their expertise in statistical theory and methods to address complex research questions and contribute to advancements in health science and policy.

    What is the purpose of hypothesis testing in biostatistics?
A) To determine if there is enough evidence to reject a null hypothesis.
B) To calculate standard deviation.
C) To prove a hypothesis with 100% certainty.
D) To estimate the population mean.
  • 2. In a clinical trial, what is the role of a control group?
A) To analyze the results.
B) To collect data from participants.
C) To provide a baseline for comparison to the treatment group.
D) To administer the treatment to participants.
  • 3. Which type of study design is best suited for determining cause and effect relationships?
A) Case-Control Study
B) Observational Study
C) Randomized Controlled Trial
D) Cross-Sectional Study
  • 4. Which statistical test can be used to compare more than two group means?
A) Chi-Square Test
B) Two-Sample t-test
C) Paired t-test
D) ANOVA
  • 5. What is the purpose of regression analysis?
A) To estimate population parameters.
B) To explore the relationship between a dependent variable and one or more independent variables.
C) To calculate probabilities.
D) To determine central tendency.
  • 6. Which type of sampling technique divides a population into subgroups and then samples each subgroup?
A) Cluster Sampling
B) Stratified Sampling
C) Systematic Sampling
D) Simple Random Sampling
  • 7. What does p-value indicate in hypothesis testing?
A) The confidence interval of the estimate.
B) The strength of the relationship between variables.
C) The probability of obtaining results as extreme as the observed results, assuming the null hypothesis is true.
D) The sample size required for the study.
  • 8. What is sensitivity in the context of diagnostic testing?
A) The proportion of false negative results.
B) The proportion of true positive results among all individuals with the condition.
C) The proportion of false positive results.
D) The proportion of true negative results among all individuals without the condition.
  • 9. What is biostatistics also referred to as?
A) Biomechanics
B) Bioinformatics
C) Biomathematics
D) Biometry
  • 10. Which field is closely related to medical statistics?
A) Pathology
B) Epidemiology
C) Pharmacology
D) Biostatistics
  • 11. Who started genetics studies by investigating segregation patterns in pea families?
A) Francis Galton
B) Charles Darwin
C) Gregor Mendel
D) William Bateson
  • 12. Who strongly disagreed with Galton's ideas on heredity?
A) William Bateson
B) Raphael Weldon
C) Karl Pearson
D) Arthur Dukinfield Darbishire
  • 13. Which group supported Mendel's ideas on genetic inheritance?
A) Darwinists
B) Neo-Darwinians
C) Biometricians
D) Mendelians
  • 14. Who developed the ANOVA and p-value concepts?
A) Betty Allan
B) Ronald Fisher
C) J. B. S. Haldane
D) Sewall G. Wright
  • 15. Who developed F-statistics and methods of computing them?
A) Betty Allan
B) J. B. S. Haldane
C) Ronald Fisher
D) Sewall G. Wright
  • 16. What did J. B. S. Haldane's book reestablish as the premier mechanism of evolution?
A) Natural selection
B) Gene flow
C) Mutation
D) Genetic drift
  • 17. Who banned the Friden calculator from his department at Caltech?
A) J. B. S. Haldane
B) Thomas Hunt Morgan
C) Ronald Fisher
D) Sewall G. Wright
  • 18. Which of the following is NOT a basic principle of experimental statistics?
A) Sample size determination
B) Randomization
C) Replication
D) Local control
  • 19. What should guide the formulation of a research question?
A) Cost considerations.
B) An exhaustive literature review.
C) The experimental design.
D) Data analysis perspectives.
  • 20. Which component of research planning involves defining how to ask a scientific question?
A) Data analysis perspectives.
B) Experimental design.
C) Costs involved.
D) The research question.
  • 21. Which principle of experimental statistics helps to eliminate bias?
A) Randomization
B) Cost estimation
C) Replication
D) Local control
  • 22. What is the first step in defining a research question according to the text?
A) Determining data collection methods.
B) Conducting an exhaustive literature review.
C) Outlining experimental design.
D) Estimating costs.
  • 23. In the formula for arithmetic mean, what does '∑' represent?
A) Summation
B) Difference
C) Product
D) Division
  • 24. Which cloud service provider is mentioned as a tool for statistical analysis in biological data?
A) Google Cloud Platform
B) Amazon Web Services
C) IBM Cloud
D) Microsoft Azure
  • 25. Which software is used for linear algebra computations?
A) NumPy
B) LAPACK
C) SciPy
D) SageMath
  • 26. What does a Pearson correlation coefficient value of -1 indicate?
A) A perfect negative correlation
B) No linear correlation
C) An undefined relationship
D) A perfect positive correlation
  • 27. Which database is dedicated to Arabidopsis thaliana?
A) TAIR
B) KEGG
C) Phytozome
D) dbSNP
  • 28. Which database stores assemblies and annotation files of dozens of plant genomes?
A) Phytozome
B) KEGG
C) dbSNP
D) TAIR
  • 29. What is another term for a scatter plot?
A) Line graph
B) Bar diagram
C) Pie chart
D) Scatter chart
  • 30. What is a scatter plot also known as?
A) Histogram
B) Scattergram
C) Pie chart
D) Bar chart
  • 31. Which distribution was initially used for RNA-Seq counts data but underestimated sample error?
A) Poisson
B) Normal
C) Negative Binomial
D) Binomial
  • 32. What major initiative relates data from DDBJ, EMBL-EBI, and NCBI?
A) International Nucleotide Sequence Database Collaboration (INSDC)
B) World Data Exchange Program
C) Bioinformatics Data Consortium
D) Global Genome Initiative
  • 33. What does a significance level (α) represent in hypothesis testing?
A) The acceptable error rate when deciding statistical significance
B) The probability that the null hypothesis is true
C) The correlation coefficient between two variables
D) The range of values for a confidence interval
  • 34. Which statistical models are used to perform tests for statistical significance in RNA-Seq data analysis?
A) Linear regression models
B) Generalized linear models
C) ANOVA
D) Chi-square tests
  • 35. Which type of graph is best suited for showing changes over time?
A) Pie chart
B) Bar chart
C) Histogram
D) Line graph
  • 36. What is a genome-wide association study (GWAS) based on?
A) Recombination frequency.
B) Quantitative trait loci.
C) Genomic selection.
D) Linkage disequilibrium.
  • 37. Which biostatistical method has gained popularity for statistical classification?
A) Random forests
B) Bootstrapping
C) Re-sampling methods
D) Decision trees
  • 38. What does marker-assisted selection aim to improve?
A) Clinical decision support systems.
B) Genomic selection models.
C) Quantitative trait mapping.
D) Breeding outcomes in agriculture.
  • 39. Which software package allows for variance component estimation under a general linear mixed model using REML?
A) ASReml
B) CycDesigN
C) Orange
D) SAS
  • 40. How does a well-defined research question benefit the scientific community?
A) By reducing the need for replication.
B) By minimizing costs.
C) By simplifying data analysis.
D) By adding value through novel insights.
  • 41. In a line graph, which axis typically represents time?
A) The vertical axis
B) Both axes equally represent time
C) The horizontal axis
D) Time is not represented in a line graph
  • 42. What is the formula for calculating the total number of observations (N) in a frequency table?
A) N = f1 + f2 + f3 + ... + fn
B) N = fi / N
C) N = fi * N
D) N = fi - N
  • 43. Which programming language is associated with deep-learning and image analysis in bioinformatics?
A) SQL
B) R
C) SAS
D) Python
  • 44. Which programming language is known for its open-source environment and statistical computing capabilities, with packages available on CRAN?
A) MATLAB
B) Python
C) R
D) SQL
  • 45. Which symbol represents the arithmetic mean in mathematical notation?
A) x̄
B) Σ
C) i
D) n
  • 46. What technique considers the perturbation of whole gene sets rather than single genes?
A) Next-generation sequencing
B) Linear discriminant analysis
C) Gene Set Enrichment Analysis (GSEA)
D) Principal component analysis
  • 47. Which aspect of research planning involves determining how to collect data?
A) Hypothesis testing.
B) Data collection methods.
C) Research question formulation.
D) Cost estimation.
  • 48. Which database is used for indexing scientific articles?
A) dbSNP
B) Gene Ontology
C) KEGG
D) PubMed
  • 49. What is the term for high intercorrelation between predictors in biostatistical settings?
A) Principal component analysis
B) Dimensionality reduction
C) Multicollinearity
D) Gene Set Enrichment Analysis
  • 50. Which software is a Java-based tool for machine learning and data mining?
A) R
B) SAS
C) Orange
D) Weka
  • 51. Which biostatistical method helps in reducing dimensionality by transforming predictors into a smaller set of uncorrelated components?
A) Linear regression
B) Principal component analysis
C) Logistic regression
D) Gene Set Enrichment Analysis
  • 52. Which software supports Quantitative Response Assays for regulated environments such as drug testing?
A) Apache Spark
B) PLA 3.0
C) SAS
D) Weka
  • 53. In which field is the design and analysis of clinical trials particularly important?
A) Quantitative genetics
B) Public health
C) Systems medicine
D) Animal breeding
  • 54. Which tool is used for high-level data processing, data mining, and visualization?
A) Orange
B) CycDesigN
C) PLA 3.0
D) ASReml
  • 55. Which mapping algorithm is not commonly used in QTL mapping?
A) Interval Mapping
B) Composite Interval Mapping
C) Multiple Interval Mapping
D) None of the above
  • 56. Which database focuses on SNPs?
A) Gene Ontology
B) dbSNP
C) PubMed
D) KEGG
  • 57. Who introduced histograms as a graphical representation?
A) Francis Galton
B) John Tukey
C) Ronald Fisher
D) Karl Pearson
Created with That Quiz — the math test generation site with resources for other subject areas.