IT350M6-6: Explore non-relational database alternatives
Big Data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management
tools or traditional data processing applications. The challenges include capture, curation, storage, search, sharing, transfer, analysis and
visualization. The trend to larger data sets is due to the additional information derivable from analysis of a single large set of related data, as
compared to separate smaller sets with the same total amount of data, allowing correlations to be found to delineate business trends, determine
quality of research, prevent diseases, link legal citations, combat crime, and determine real-time roadway traffic conditions.
Big Data analytics is a topic fraught with both positive and negative potential. Big Data is defined not just by the amount of information involved but
also its variety and complexity, as well as the speed with which it must be analyzed or delivered. The amount of data being produced is already
incredibly great, and current developments suggest that this rate will only increase in the near future. Improved service should result as companies
better understand their customers, but it is also possible that this data will create privacy problems. Thus, Big Data is important not only to students
who hope to gain employment using these techniques and those who plan to use it for legitimate research, but also for everyone who will be living
and working in the 21st Century.
Assessment Instructions
This competency assessment is divided into two tasks covering non-relational database facets. You will generate two separate report documents,
each addressing a specific task, for this assessment.
It is very important that you watch the Module 6 videos associated with SQL prior to completing the assessment. You will need to install and use
Microsoft SQL Server Express and Microsoft SQL Server Management Studio (SSMS) for this course. You can download the latest versions of
these free software products here:
Microsoft SQL Server Express
Microsoft SSMS
. Navigate to the Academic Tools area of this Module and select Library then Required Readings to access your texts and videos. You will need to
install and use Microsoft SQL Server Express and Microsoft SQL Server Management Studio (SSMS) for this course.
Task 1 – Big Data Use Cases
Perform research on Big Data use cases via the Internet and Purdue University Global library. Use the article at the following website as a starting
point for your research:
Big Data Use Cases
Select one use case from the list below to be the topic of your paper.
1. 360° View of the Customer
2. Fraud Prevention
3. Security Intelligence
4. Data Warehouse Offload
5. Price Optimization
6. Operational Efficiency
7. Recommendation Engines
8. Social Media Analysis and Response
9. Preventive Maintenance and Support
10. Internet-of-Things (IoT)
Write a 3-page expository paper, not including title page or references, that addresses the following:
Describe the use case and how it makes use of Big Data.
2022/01/07 14:38 Purdue University Global
https://purdueglobal.brightspace.com/d2l/le/content/198702/viewContent/13257482/View 2/3
Explain the V’s of Big Data within the context of your chosen use case.
Volume
Velocity
Variety
Veracity
Task 2 – Exploring the Hadoop Environment
You will download and install software products that will allow you to use and explore the Hadoop environment. You will then perform tive specified
exercises with the installed environment.
Cloudera is a software company that provides a platform for data analytics, data warehousing, and machine learning. Initially, Cloudera started as
an open-source Apache Hadoop distribution project, commonly known as Cloudera Distribution for Hadoop or CDH. It contains Apache Hadoop and
other related projects where all the components are 100% open-source under Apache License.
The Cloudera QuickStart virtual machine (VM) includes everything that you would need for using CDH, including Impala, Cloudera Search, and
Cloudera Manager. The Cloudera QuickStart VM uses a package-based install that allows you to work with or without the Cloudera Manager. It has
a sample of Cloudera’s platform for “Big Data.”
You are required to complete the subtasks listed below. Generate a Microsoft Word report incorporating the specified artifacts from the subtask
work.
Task 2.1 – Install Oracle VirtualBox and the Cloudera QuickStart VM
Install Oracle VirtualBox and the Cloudera QuickStart VM using the following guidance document:
Installation Instructions for the Cloudera Quickstart Virtual Machine
Take screen captures to prove that you accomplished the installation tasks. Incorporate the screen captures into your Microsoft Word assessment
document. Describe your experience with installing and operating these software programs via a minimum of two paragraphs of content.
Task 2.2 – Complete Tutorial Exercise 1
Complete Exercise 1 (pages 1-11) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
Task 2.3 – Complete Tutorial Exercise 2
Complete Exercise 2 (pages 12-20) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
Task 2.4 – Complete Tutorial Exercise 3
Complete Exercise 3 (pages 21-26) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
Task 2.5 – Complete Tutorial Exercise 4
Complete Exercise 4 (pages 27-36) contained in the following tutorial document:
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content.
The exercise entailed examination of log records, which indicated the occurrence of distributed denial-of-service (DDoS) attacks. Describe what
DDoS is and how it can be damaging to an organization via a minimum of one paragraph of content.
Task 2.6 – Complete Tutorial Exercise 5
Complete Exercise 5 (pages 37-43) contained in the following tutorial document:
2022/01/07 14:38 Purdue University Global
https://purdueglobal.brightspace.com/d2l/le/content/198702/viewContent/13257482/View 3/3
Cloudera Quickstart Beginner Tutorial
Take screen captures to prove that you completed this exercise. Incorporate the screen captures into your Microsoft Word assessment document.
Describe your experiences in completing this tutorial exercise via a minimum of two paragraphs of content. Also, provide the benefits of using data
visualizations like that established in this exercise via a minimum of one paragraph of content.
Collepals.com Plagiarism Free Papers
Are you looking for custom essay writing service or even dissertation writing services? Just request for our write my paper service, and we'll match you with the best essay writer in your subject! With an exceptional team of professional academic experts in a wide range of subjects, we can guarantee you an unrivaled quality of custom-written papers.
Get ZERO PLAGIARISM, HUMAN WRITTEN ESSAYS
Why Hire Collepals.com writers to do your paper?
Quality- We are experienced and have access to ample research materials.
We write plagiarism Free Content
Confidential- We never share or sell your personal information to third parties.
Support-Chat with us today! We are always waiting to answer all your questions.