Q1. Which technology framework is commonly used for distributed storage and processing of Big Data?
Flink
Spark
Hadoop
Kafka
Q2. How many V's are there in Big Data?
5
4
2
3
Q3. Which technology is commonly used for distributed data storage in Big Data systems?
MongoDB
Cassandra
SQL
HDFS
Q4. Which of these has the world's largest Hadoop cluster?
Facebook
Datamatics
All of the above
Apple
Q5. Which of the following is not a key feature of Apache Spark?
In-memory Computing
Batch Processing
Real-time Processing
MapReduce Support
Q6. Which type of database is optimized for handling transactional workloads and providing high availability?
NoSQL
NewSQL
OLTP
OLAP
Q7. Which technology is commonly used for distributed messaging in Big Data systems?
Hadoop
Kafka
Flink
Spark
Q8. Hadoop is a framework. It is used with several types of related tools. What are its common cohorts?
MapReduce, Heron, an Trumpet
MapReduce, MySQL, and Google Apps
MapReduce, Hummer, and Iguana
MapReduce, Hive, and HBase
Q9. Which of these options is Hadoop named after?
The toy elephant of Creator Cutting's son
Creator Doug Cutting's favourite circus act
A sound Cutting's laptop made during Hadoop development
Cutting's high school best friend
Q10. What is the term for the process of storing data across multiple servers to ensure redundancy and fault tolerance?
Data Redundancy
Data Replication
Data Sharding
Data Partitioning
Q11. Which of the following is not a challenge associated with Big Data?
Security
Privacy
Data Consistency
Scalability
Q12. What can be described as a model for programming used to develop applications based on Hadoop that can process massive amounts of data?
MapReduce
None of the above
Oozie
Mahout
Q13. Which of the following is not a component of the Hadoop ecosystem?
HDFS
Spark
MapReduce
YARN
Q14. Which technology is commonly used for real-time stream processing in Big Data systems?
Hadoop
Spark
Flink
Kafka
Q15. What is the term for a large volume of data that cannot be processed using traditional database techniques?
Huge Data
Mega Data
Big Data
Massive Data
Q16. Which of the following is not a data type commonly encountered in Big Data?
XML
JSON
Binary
CSV
Q17. What is the transaction data of the bank?
Both 1 and 2
Structured data
Unstructured data
None of the above
Q18. Which technology is commonly used for real-time data analytics and visualization?
Power BI
Tableau
QlikView
Databricks
Q19. Which of the following is not a characteristic of Big Data?
Volume
Variety
Velocity
Velocity
Q20. Which of the following is not a characteristic of a data warehouse?
Optimized for analytics
Real-time processing
Integrated data
Historical data
Q21. Which of these projects based on Hadoop is used by Facebook to tackle with Big Data?
Prism
Project Prism
Project Data
Project Big
Q22. Which of the following is not a layer of the Big Data stack?
Processing Layer
Application Layer
Presentation Layer
Storage Layer
Q23. What is the term for the process of cleaning and transforming raw data into a usable format for analysis?
Data Cleansing
Data Staging
Data Preparation
Data Scrubbing
Q24. What is the term for the process of analyzing large and complex datasets to uncover patterns, trends, and insights?
Data Visualization
Data Warehousing
Data Analysis
Data Mining
Q25. All the options given accurately describe Hadoop except one. Which one is it?
Open-source
Real-time
Java-based
Distributed computing approach
Q26. Big Data can be found in how many versions?
3
4
2
1
Q27. What is the term for a collection of data that is too large to be processed using traditional database techniques?
Data Pond
Data Lake
Data Reservoir
Data Stream
Q28. What is the term for the process of integrating data from multiple sources to create a unified view?
Data Normalization
Data Aggregation
Data Integration
Data Fusion
Q29. Data is what size of bytes is known as Big Data?
Giga
Tera
Peta
Meta
Q30. Which type of data refers to data that is generated in real-time or near real-time?
Semi-Structured Data
Structured Data
Streaming Data
Unstructured Data