Practice: M11 — Introduction to Big Data Techniques

Module: M11 Glossary: M11 Terms


Question 1: “Fintech” is best described as:

A. A technology-driven innovation in the financial service industry B. The collection of large quantities of financial data from a variety of sources in multiple formats C. The use of technical models to describe patterns in financial markets and make trading decisions


Question 2: Which of the following statements is true in the use of Machine Learning (ML)?

A. Some techniques are termed “black box” due to data biases B. Human judgment is not needed because algorithms continuously learn from data C. Training data can be learned too precisely, resulting in inaccurate predictions when used with different datasets


Question 3: Text analytics is appropriate for application to:

A. Large, structured datasets B. Public but not private information C. Identifying possible short-term indicators of coming trends


Question 4: The three characteristics of Big Data are best described as:

A. Volume, Velocity, Variety B. Volume, Variance, Visualization C. Validity, Velocity, Value


Question 5: Machine learning that uses labeled training data to predict outcomes for new datasets is called:

A. Supervised learning B. Unsupervised learning C. Deep learning


Question 6: An ML model performs excellently on training data but poorly on new data. This is most likely due to:

A. Underfitting B. Overfitting C. Black box problem


Question 7: Which is not a traditional source of financial data?

A. Government economic statistics B. Social media posts and online reviews C. Company annual reports and filings


Question 8: The correct order of the Big Data processing pipeline is:

A. Capture → Storage → Curation → Search → Transfer B. Capture → Curation → Storage → Search → Transfer C. Curation → Capture → Search → Storage → Transfer