Proper study guides for Abreast of the times IBM IBM Big Data Architect certified begins with IBM C2090-102 preparation products which designed to deliver the Guaranteed C2090-102 questions by making you pass the C2090-102 test at your first time. Try the free C2090-102 demo right now.
Also have C2090-102 free dumps questions for you:
NEW QUESTION 1
Company A is searching for a browser-based visualization tool to perform analysis
on vast amounts of data in any structure. They want to execute operations such as pivot, slice and dice, among others. Which of the following would meet these requirements?
- A. Streams
- B. BigSheets
- C. Aginity Workbench
- D. Watson Explorer
Answer: B
Explanation:
References:
http://www.dotgroup.co.uk/wp-content/uploads/2014/11/Harness-the-Power-of-Big-Data- The-IBM-Big-DataPlatform.pdf Page: 132
NEW QUESTION 2
Which of the following statements regarding Big R is TRUE?
- A. Missing data values must be handled by ETL processes prior to analyzing data with Big R
- B. A bigr.frame loads data in memory for optimal performance
- C. A Big R user is responsible for parallelizing the execution of the R functions being used in the R program
- D. Performing a mathematical operation on a Big R vector variable willautomatically loop through each item inthe vector
Answer: D
Explanation:
Reference:
http://www.computerworld.com/article/2497319/business-intelligence-beginner-s-guide-to-r-syntax-quirks-you-llwant-to-know.html
NEW QUESTION 3
Considering a service level requirement (SLR) of less than 3 milliseconds, what task must be performed to meet the SLR?
- A. Collect and analyze SNMP MIB data
- B. Survey a representative amount of end users
- C. Measure switch failure frequency
- D. Inquire when prime network hours occur
Answer: C
NEW QUESTION 4
In designing a new Hadoop system for a customer, the option of using SAN versus DAS was brought up. Which of the following would justify choosing SAN storage?
- A. SAN storage provides better performance than DAS
- B. SAN storage reduces and removes a lot of the HDFS complexity and management issues
- C. SAN storage removes the Single Point of Failure for the NameNode
- D. SAN storage supports replication, reducing the need for 3-way replication
Answer: D
NEW QUESTION 5
A smart meter project is being undertaken by anElectric utility headquartered in San Francisco. They are looking for technology for monitoring thousands of meters, multiple times within an hour, andperforming corresponding downstream advisory action to influence usage behavior. Which of the following would you recommend to function at the heart of this process?
- A. Hadoop
- B. SPSS
- C. Reporting engine and a portal
- D. Infosphere Streams
Answer: B
NEW QUESTION 6
Which of the following statements regarding Big R is TRUE?
- A. Unless specified otherwise, Big R automatically assumes all data to be integers
- B. Big R’s ‘bigr.frame’ is equivalent to R’s ‘data.frames’
- C. When you execute Big R “apply” function, Big R transparently extracts data out of HDFS into the Big R engine
- D. A data analyst using Big R employs Map Reduce programming principles
Answer: A
Explanation:
Reference:
https://developer.ibm.com/hadoop/docs/biginsights-value-add/big-r/bigr-tutorial/
NEW QUESTION 7
Faced with a wide area network implementation, you have a need for asynchronous remote updates. Which one of the following would best address this use case?
- A. GPFS Active File Management allows data access and modifications even when remote storage cluster is unavailable
- B. HDFS Cluster rebalancing is compatible with data rebalancing scheme
- C. A scheme might automatically move data from one DataNode to another if the free space on a DataNode falls below a certain threshold
- D. GPFS File clones can be created from a regular file or a file in a snapshot using the mmclone command
- E. HDFS NameNode The NameNode keeps an image of the entire file systemnamespace and file Blockmap in memor
- F. This key metadata item is designed to be compact, such that a NameNode with 4 GB of RAM is plenty to support a huge number of files and directories
Answer: C
Explanation:
Reference:
http://www-01.ibm.com/support/knowledgecenter/STXKQY_4.1.1/com.ibm.spectrum.scale.v4 r11.adv.d oc/bl1adv_clones.htm
NEW QUESTION 8
In a typical Hadoop HA cluster, two separate machines are configured as which of the following?
- A. Data Nodes
- B. Edge Nodes
- C. Name Nodes
- D. None of the Above
Answer: A
Explanation:
Reference:
http://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-hdfs/HDFSHighAvailabilityWithQJM.html
NEW QUESTION 9
For company B, 85% of their analytics queries only involve about 25% of their data; another 10% of the queries will touch 35% of the rest of the data, and only 5% of the queries will touch the remaining 40% of the data. The estimated volume is 50TB growing at 1 TB per year. Which of the following would provide the best value (business benefit) and lowest TCO?
- A. Place the entire set of data in a data warehouse with proper partitioning and indexing
- B. Place the entire set of data in a hadoop environment – using commodity HW
- C. Place the top 25% of data (used by 85% of the query) in a hadoop environment, and the rest in a data warehouse
- D. Place the top 25% of data (used by 85% of the query) in a data warehouse, and the rest in a hadoop environment
Answer: C
NEW QUESTION 10
A major telecommunication company has millions of customers. Most of their customers are prepaid. Being prepaid customers, they can very easily switch to other vendors. The last four to six months, this company has lost quite a good number of customers to competition. They intend to build a system that can provide them with insight into the customer’s social network (e.g. who is the influencer and who is the follower). They also want the ability to monitor the voice
and data usage patterns in real time and they want the system to be trained over time to predict possible dissatisfactions. Given this scenario, which one of the following would you recommend?
- A. Hadoop
- B. Spark
- C. Cloudant
- D. Netezza
Answer: B
NEW QUESTION 11
Which of the following statements is TRUE regarding Cloud deployment models?
- A. Performance and scalability requirements are a critical factor for deciding between Platform as a Serviceand Infrastructure as a Service deployment models
- B. In a platform as a Service offering, the customer has root access to the servers
- C. Applications with extremely high transactions volumes are good candidates for Platform as a Service
- D. In an infrastructure as a service deployment, the cloud provider provides security patching, monitoring andfail over capabilities
Answer: A
NEW QUESTION 12
Company A has decided to implement a new data system to support their rapidly growing business. They have an existing 20 TB worth of raw data, with an expected weekly incoming rate of 50 GB of new raw data. The data is mostly text based and unstructured. A typical query can involve pulling in 10 GB of data. Historically, performance has been an issue and currently needs to be addressed. Which of the following would you suggest to support these requirements?
- A. Set up a Hadoop system with commodity HW for scalability
- B. Utilize de-duplication and compression technology
- C. Use a mixture of different disk-types to provide hot/cold storage
- D. Create range partitions for the data
Answer: A
NEW QUESTION 13
A company has to design a new data system. They will need to support several OLTP applications. Every three days a batch job will run to load specific data into a set of 10 large tables (with historical data) where OLAP analytics will be performed. Performance for both OLTP and OLAP queries is important. Which of the following designs would you suggest to the company?
- A. Use a NoSQL data store such as MongoDB or Cloudant on the cloud to provide needed scalability
- B. Use DB2 Data Partition Feature (DPF), partitioning all tables into different partitions
- C. Use DB2 Data Partition Feature (DPF), only partitioning the 10 tables where Analytics will be run
- D. Use DB2 with BLU Acceleration, use columnar store for the 10 tables where Analytics will be run
Answer: D
NEW QUESTION 14
You have implemented a large Hadoop MapReduce cluster and the applications and users are multiplying. You are now faced with requests for interactive and streaming data applications while you still need to support the original MapReduce batch Jobs. Select the best option for continued support and performance.
- A. Just add several data nodes as Hadoop clusters are designed to scale-up easily
- B. Keep your original cluster configuration, all that is needed is re-optimizing the Oozie- workflow management
- C. Implement Yarn to decouple MapReduce and resource management
- D. Implement Apache Cassandra to automatically optimize multi-tenancy workloads
Answer: D
NEW QUESTION 15
A bank wants to build a system that tracks all ATM and online transactions in real- time. They want to build a personalized model of their customer’s financial activity by incorporating enterprise data as well as social media data. The system must be able to learn and adapt over a period of time. These personalized models will be used for real time promotions as well as for any fraud or crime detections. Given these requirements, which of the following would recommend?
- A. Spark
- B. Hadoop
- C. Cloudand
- D. Netezza
Answer: D
NEW QUESTION 16
The AQL query language is the easiest and most flexible tool to pull structured output from which of the following?
- A. Hive data structures
- B. Unstructured text
- C. Hbase schemas
- D. JDBC connected relational data marts
Answer: A
Explanation:
Reference:
http://www.ibm.com/developerworks/library/bd-sqltohadoop2/
NEW QUESTION 17
The Yarn Resource Managers (RM) have an option to embed an ActiveStandbyElector to decide which RM should be the Active. Upon which of the following is it based?
- A. Job Tracker
- B. Zookeeper
- C. Task Tracker
- D. Name Node
Answer: C
NEW QUESTION 18
Which of the following is NOT a valid Big Data platform integration?
- A. Platform plugins
- B. Intraplatform integration
- C. Enterprise integration with other repositories
- D. Network integration
Answer: D
NEW QUESTION 19
Which of the following statements is TRUE regarding cloud based solutions?
- A. In a Platform as a Service Cloud deployment, the customer chooses the operating system they want to use
- B. Automated recovery from hardware or network failures is not possible in a public cloud implementation, onlyin a private clouds
- C. There are benefits to use the cloud even for small-scale applications
- D. Using firewalls to create network boundaries is sufficient for ensuring cloud security
Answer: C
Explanation:
References:
http://www.ibm.com/developerworks/cloud/library/cl-cloudappdevelop/
NEW QUESTION 20
BigInsights is a solution that accomplishes which of the following?
- A. Replaces the traditional Data warehouses
- B. Can exchange information with the traditional Data warehouses only
- C. Includes a connector that enables data exchange between a BigInsights cluster and Netezza appliance in only one way
- D. Supports data exchange with a number of sources
Answer: C
Explanation:
Reference:
https://www- 01.ibm.com/support/knowledgecenter/SSPT3X_2.1.1/com.ibm.swg.im.infosphere.biginsight s.install.doc/Install.pdf
NEW QUESTION 21
......
P.S. Thedumpscentre.com now are offering 100% pass ensure C2090-102 dumps! All C2090-102 exam questions have been updated with correct answers: https://www.thedumpscentre.com/C2090-102-dumps/ (110 New Questions)