A) The measure of confidence in the null hypothesis B) The population parameter being tested C) The probability of obtaining results at least as extreme as the observed results, given that the null hypothesis is true D) The significance level for accepting the null hypothesis
A) t-test B) Wilcoxon signed-rank test C) Kruskal-Wallis test D) Mann-Whitney U test
A) To identify outliers in a dataset B) To examine the relationship between variables C) To test for differences in means D) To summarize categorical data
A) The variability within groups B) The spread of the data C) The central tendency of a dataset D) The strength and direction of a linear relationship between two variables
A) To predict future data points B) To estimate the range within which the population parameter is likely to fall C) To determine the probability of an event occurring D) To compare two independent groups
A) Simple random sampling B) Cluster sampling C) Convenience sampling D) Systematic sampling
A) Polynomial regression. B) Ridge regression. C) Linear regression. D) Logistic regression.
A) The measure of correlation between two variables B) The probability of rejecting the null hypothesis when it is actually true C) The level of confidence in the alternative hypothesis D) The margin of error in the sample mean
A) Regression analysis. B) Factor analysis. C) Time series analysis. D) Cluster analysis.
A) Chi-square test. B) T-test. C) Regression analysis. D) ANOVA.
A) William Sealy Gosset B) John Tukey C) Carlo Lauro D) RAND Corporation
A) Only in data science. B) Strictly within computational linguistics. C) Exclusively in social data science. D) Econometrics.
A) To calculate the range of a dataset B) To compare two different samples C) To state that the sampling distribution of the sample mean approaches a normal distribution as the sample size increases D) To determine the variability within groups
A) Kernel density estimation B) Markov chain Monte Carlo methods C) Artificial neural networks D) Monte Carlo method simulation
A) Imputation. B) Feature engineering. C) Normalization. D) Outlier detection.
A) A statement that there is no significant difference between specified populations B) The hypothesis that the researcher believes to be true C) The hypothesis that is tested using a one-tailed test D) A statement that predicts an outcome in an experiment
A) Kernel density estimation B) Artificial neural networks C) The jackknife method. D) Markov chain Monte Carlo methods
A) Regression analysis B) ANOVA C) T-test D) Chi-square test
A) Numerical integration B) Bayesian updating C) Optimization D) Generating draws from a probability distribution
A) Bootstrap method B) Markov Chain Monte Carlo C) Monte Carlo method D) Maximum likelihood estimation
A) Computational physics. B) Culinary arts. C) Classical music composition. D) Traditional painting techniques.
A) Exact analytical solutions B) Numerical integration C) Generating draws from a probability distribution D) Optimization
A) A random sample B) A probability density C) A likelihood function D) An error function
A) International Linguistics Society. B) International Association for Statistical Computing. C) American Medical Association. D) World Health Organization.
A) Monte Carlo simulation device B) John Tukey’s jackknife C) ERNIE D) RAND Corporation tables
A) Focusing solely on small sample sizes. B) Avoiding the use of computers in statistical analysis. C) Developing new mathematical theories without practical application. D) Transforming raw data into knowledge using computer-intensive methods.
A) Correlation measures the strength of a relationship, while causation measures the direction B) Correlation indicates a relationship between variables, while causation implies one variable causes a change in the other C) Correlation refers to linear relationships, while causation refers to non-linear relationships D) Correlation is used for categorical data, while causation is used for continuous data |