Q1. What is the term for the process of analyzing large and complex datasets to uncover patterns, trends, and insights?
Data Visualization
Data Analysis
Data Warehousing
Data Mining
Q2. What is the term for a large volume of data that cannot be processed using traditional database techniques?
Big Data
Mega Data
Massive Data
Huge Data
Q3. Which of these options is Hadoop named after?
Cutting's high school best friend
The toy elephant of Creator Cutting's son
A sound Cutting's laptop made during Hadoop development
Creator Doug Cutting's favourite circus act
Q4. Which of the following is not a component of the Hadoop ecosystem?
MapReduce
Spark
YARN
HDFS
Q5. Which technology is commonly used for distributed data storage in Big Data systems?
HDFS
MongoDB
SQL
Cassandra
Q6. Hadoop is a framework. It is used with several types of related tools. What are its common cohorts?
MapReduce, Heron, an Trumpet
MapReduce, MySQL, and Google Apps
MapReduce, Hive, and HBase
MapReduce, Hummer, and Iguana
Q7. What can be described as a model for programming used to develop applications based on Hadoop that can process massive amounts of data?
Mahout
MapReduce
Oozie
None of the above
Q8. What is the term for a collection of data that is too large to be processed using traditional database techniques?
Data Stream
Data Lake
Data Pond
Data Reservoir
Q9. Which of the following is not a characteristic of Big Data?
Velocity
Variety
Volume
Velocity
Q10. Which of the following is not a characteristic of a data warehouse?
Real-time processing
Integrated data
Historical data
Optimized for analytics
Q11. Which technology framework is commonly used for distributed storage and processing of Big Data?
Flink
Kafka
Hadoop
Spark
Q12. Which of the following is not a challenge associated with Big Data?
Scalability
Privacy
Data Consistency
Security
Q13. Which type of database is optimized for handling transactional workloads and providing high availability?
NewSQL
OLTP
OLAP
NoSQL
Q14. Which of the following is not a layer of the Big Data stack?
Processing Layer
Storage Layer
Application Layer
Presentation Layer
Q15. How many V's are there in Big Data?
4
5
3
2
Q16. What is the transaction data of the bank?
Structured data
Unstructured data
Both 1 and 2
None of the above
Q17. Which technology is commonly used for real-time data analytics and visualization?
QlikView
Databricks
Power BI
Tableau
Q18. Which of the following is not a data type commonly encountered in Big Data?
CSV
XML
Binary
JSON
Q19. Which of these has the world's largest Hadoop cluster?
All of the above
Datamatics
Facebook
Apple
Q20. Which of the following is not a key feature of Apache Spark?
In-memory Computing
Batch Processing
MapReduce Support
Real-time Processing
Q21. Which technology is commonly used for real-time stream processing in Big Data systems?
Flink
Spark
Hadoop
Kafka
Q22. Which technology is commonly used for distributed messaging in Big Data systems?
Hadoop
Spark
Kafka
Flink
Q23. Which of these projects based on Hadoop is used by Facebook to tackle with Big Data?
Prism
Project Data
Project Prism
Project Big
Q24. Big Data can be found in how many versions?
1
2
3
4
Q25. Which type of data refers to data that is generated in real-time or near real-time?
Structured Data
Semi-Structured Data
Streaming Data
Unstructured Data
Q26. All the options given accurately describe Hadoop except one. Which one is it?
Distributed computing approach
Java-based
Open-source
Real-time
Q27. What is the term for the process of cleaning and transforming raw data into a usable format for analysis?
Data Staging
Data Cleansing
Data Scrubbing
Data Preparation
Q28. What is the term for the process of storing data across multiple servers to ensure redundancy and fault tolerance?
Data Replication
Data Partitioning
Data Redundancy
Data Sharding
Q29. What is the term for the process of integrating data from multiple sources to create a unified view?
Data Normalization
Data Integration
Data Fusion
Data Aggregation
Q30. Data is what size of bytes is known as Big Data?
Meta
Tera
Giga
Peta