ThatQuiz Test Library Take this test now
Biostatistics - Exam
Contributed by: Burrows
  • 1. Biostatistics is a branch of statistics that deals with data related to living organisms. It involves the design, analysis, and interpretation of data in fields such as biology, medicine, public health, and environmental science. Biostatistics plays a crucial role in research studies, clinical trials, and public health initiatives by providing statistical methods to analyze data, draw conclusions, and make informed decisions. It helps in understanding patterns of diseases, identifying risk factors, evaluating treatment interventions, and predicting health outcomes. Biostatisticians use their expertise in statistical theory and methods to address complex research questions and contribute to advancements in health science and policy.

    What is the purpose of hypothesis testing in biostatistics?
A) To estimate the population mean.
B) To determine if there is enough evidence to reject a null hypothesis.
C) To calculate standard deviation.
D) To prove a hypothesis with 100% certainty.
  • 2. In a clinical trial, what is the role of a control group?
A) To analyze the results.
B) To administer the treatment to participants.
C) To provide a baseline for comparison to the treatment group.
D) To collect data from participants.
  • 3. Which type of study design is best suited for determining cause and effect relationships?
A) Randomized Controlled Trial
B) Case-Control Study
C) Observational Study
D) Cross-Sectional Study
  • 4. Which statistical test can be used to compare more than two group means?
A) Chi-Square Test
B) Paired t-test
C) Two-Sample t-test
D) ANOVA
  • 5. What is the purpose of regression analysis?
A) To determine central tendency.
B) To explore the relationship between a dependent variable and one or more independent variables.
C) To calculate probabilities.
D) To estimate population parameters.
  • 6. Which type of sampling technique divides a population into subgroups and then samples each subgroup?
A) Cluster Sampling
B) Stratified Sampling
C) Simple Random Sampling
D) Systematic Sampling
  • 7. What does p-value indicate in hypothesis testing?
A) The probability of obtaining results as extreme as the observed results, assuming the null hypothesis is true.
B) The confidence interval of the estimate.
C) The sample size required for the study.
D) The strength of the relationship between variables.
  • 8. What is sensitivity in the context of diagnostic testing?
A) The proportion of true negative results among all individuals without the condition.
B) The proportion of false negative results.
C) The proportion of true positive results among all individuals with the condition.
D) The proportion of false positive results.
  • 9. What is biostatistics also referred to as?
A) Biomathematics
B) Biometry
C) Biomechanics
D) Bioinformatics
  • 10. Which field is closely related to medical statistics?
A) Pharmacology
B) Pathology
C) Epidemiology
D) Biostatistics
  • 11. Who started genetics studies by investigating segregation patterns in pea families?
A) Charles Darwin
B) William Bateson
C) Francis Galton
D) Gregor Mendel
  • 12. Who strongly disagreed with Galton's ideas on heredity?
A) William Bateson
B) Karl Pearson
C) Arthur Dukinfield Darbishire
D) Raphael Weldon
  • 13. Which group supported Mendel's ideas on genetic inheritance?
A) Biometricians
B) Mendelians
C) Darwinists
D) Neo-Darwinians
  • 14. Who developed the ANOVA and p-value concepts?
A) Ronald Fisher
B) Betty Allan
C) J. B. S. Haldane
D) Sewall G. Wright
  • 15. Who developed F-statistics and methods of computing them?
A) J. B. S. Haldane
B) Ronald Fisher
C) Betty Allan
D) Sewall G. Wright
  • 16. What did J. B. S. Haldane's book reestablish as the premier mechanism of evolution?
A) Gene flow
B) Mutation
C) Genetic drift
D) Natural selection
  • 17. Who banned the Friden calculator from his department at Caltech?
A) Thomas Hunt Morgan
B) J. B. S. Haldane
C) Sewall G. Wright
D) Ronald Fisher
  • 18. Which of the following is NOT a basic principle of experimental statistics?
A) Randomization
B) Replication
C) Sample size determination
D) Local control
  • 19. What should guide the formulation of a research question?
A) Cost considerations.
B) The experimental design.
C) An exhaustive literature review.
D) Data analysis perspectives.
  • 20. Which component of research planning involves defining how to ask a scientific question?
A) Data analysis perspectives.
B) Experimental design.
C) The research question.
D) Costs involved.
  • 21. Which principle of experimental statistics helps to eliminate bias?
A) Cost estimation
B) Replication
C) Randomization
D) Local control
  • 22. What is the first step in defining a research question according to the text?
A) Outlining experimental design.
B) Conducting an exhaustive literature review.
C) Estimating costs.
D) Determining data collection methods.
  • 23. In the formula for arithmetic mean, what does '∑' represent?
A) Difference
B) Division
C) Summation
D) Product
  • 24. Which cloud service provider is mentioned as a tool for statistical analysis in biological data?
A) Amazon Web Services
B) IBM Cloud
C) Google Cloud Platform
D) Microsoft Azure
  • 25. Which software is used for linear algebra computations?
A) SciPy
B) NumPy
C) SageMath
D) LAPACK
  • 26. What does a Pearson correlation coefficient value of -1 indicate?
A) A perfect negative correlation
B) No linear correlation
C) A perfect positive correlation
D) An undefined relationship
  • 27. Which database is dedicated to Arabidopsis thaliana?
A) TAIR
B) dbSNP
C) KEGG
D) Phytozome
  • 28. Which database stores assemblies and annotation files of dozens of plant genomes?
A) Phytozome
B) TAIR
C) dbSNP
D) KEGG
  • 29. What is another term for a scatter plot?
A) Pie chart
B) Scatter chart
C) Line graph
D) Bar diagram
  • 30. What is a scatter plot also known as?
A) Pie chart
B) Scattergram
C) Bar chart
D) Histogram
  • 31. Which distribution was initially used for RNA-Seq counts data but underestimated sample error?
A) Binomial
B) Normal
C) Poisson
D) Negative Binomial
  • 32. What major initiative relates data from DDBJ, EMBL-EBI, and NCBI?
A) World Data Exchange Program
B) International Nucleotide Sequence Database Collaboration (INSDC)
C) Global Genome Initiative
D) Bioinformatics Data Consortium
  • 33. What does a significance level (α) represent in hypothesis testing?
A) The range of values for a confidence interval
B) The acceptable error rate when deciding statistical significance
C) The probability that the null hypothesis is true
D) The correlation coefficient between two variables
  • 34. Which statistical models are used to perform tests for statistical significance in RNA-Seq data analysis?
A) Linear regression models
B) Chi-square tests
C) ANOVA
D) Generalized linear models
  • 35. Which type of graph is best suited for showing changes over time?
A) Bar chart
B) Line graph
C) Histogram
D) Pie chart
  • 36. What is a genome-wide association study (GWAS) based on?
A) Linkage disequilibrium.
B) Recombination frequency.
C) Quantitative trait loci.
D) Genomic selection.
  • 37. Which biostatistical method has gained popularity for statistical classification?
A) Decision trees
B) Random forests
C) Bootstrapping
D) Re-sampling methods
  • 38. What does marker-assisted selection aim to improve?
A) Genomic selection models.
B) Quantitative trait mapping.
C) Clinical decision support systems.
D) Breeding outcomes in agriculture.
  • 39. Which software package allows for variance component estimation under a general linear mixed model using REML?
A) Orange
B) ASReml
C) CycDesigN
D) SAS
  • 40. How does a well-defined research question benefit the scientific community?
A) By reducing the need for replication.
B) By simplifying data analysis.
C) By minimizing costs.
D) By adding value through novel insights.
  • 41. In a line graph, which axis typically represents time?
A) The vertical axis
B) Both axes equally represent time
C) Time is not represented in a line graph
D) The horizontal axis
  • 42. What is the formula for calculating the total number of observations (N) in a frequency table?
A) N = fi / N
B) N = f1 + f2 + f3 + ... + fn
C) N = fi * N
D) N = fi - N
  • 43. Which programming language is associated with deep-learning and image analysis in bioinformatics?
A) SQL
B) R
C) SAS
D) Python
  • 44. Which programming language is known for its open-source environment and statistical computing capabilities, with packages available on CRAN?
A) R
B) SQL
C) Python
D) MATLAB
  • 45. Which symbol represents the arithmetic mean in mathematical notation?
A) Σ
B) n
C) i
D) x̄
  • 46. What technique considers the perturbation of whole gene sets rather than single genes?
A) Next-generation sequencing
B) Principal component analysis
C) Gene Set Enrichment Analysis (GSEA)
D) Linear discriminant analysis
  • 47. Which aspect of research planning involves determining how to collect data?
A) Cost estimation.
B) Hypothesis testing.
C) Research question formulation.
D) Data collection methods.
  • 48. Which database is used for indexing scientific articles?
A) dbSNP
B) KEGG
C) Gene Ontology
D) PubMed
  • 49. What is the term for high intercorrelation between predictors in biostatistical settings?
A) Dimensionality reduction
B) Principal component analysis
C) Multicollinearity
D) Gene Set Enrichment Analysis
  • 50. Which software is a Java-based tool for machine learning and data mining?
A) R
B) SAS
C) Weka
D) Orange
  • 51. Which biostatistical method helps in reducing dimensionality by transforming predictors into a smaller set of uncorrelated components?
A) Principal component analysis
B) Linear regression
C) Logistic regression
D) Gene Set Enrichment Analysis
  • 52. Which software supports Quantitative Response Assays for regulated environments such as drug testing?
A) Apache Spark
B) Weka
C) SAS
D) PLA 3.0
  • 53. In which field is the design and analysis of clinical trials particularly important?
A) Animal breeding
B) Quantitative genetics
C) Systems medicine
D) Public health
  • 54. Which tool is used for high-level data processing, data mining, and visualization?
A) ASReml
B) PLA 3.0
C) Orange
D) CycDesigN
  • 55. Which mapping algorithm is not commonly used in QTL mapping?
A) Interval Mapping
B) Multiple Interval Mapping
C) Composite Interval Mapping
D) None of the above
  • 56. Which database focuses on SNPs?
A) dbSNP
B) Gene Ontology
C) KEGG
D) PubMed
  • 57. Who introduced histograms as a graphical representation?
A) Karl Pearson
B) Ronald Fisher
C) John Tukey
D) Francis Galton
Created with That Quiz — the math test generation site with resources for other subject areas.