Data Warehousing Test Answers 2015



Which of the following implements Multidimensional OLAP?
All of the Above
Microsoft SSAS
Oracle Essbase
IBM TM1


Which of the following is true about Data Scrubbing?
It involves data cleansing
Harmonization of data cannot be considered as Data Scrubbing
It is same as data validation
Standardization of data cannot be considered as Data Scrubbing


When is it appropriate to use a Joiner Transformation during ETL?
When the two sources have a primary-key to foreign-key relationship
When the two sources have multiple matching columns
When the two sources are heterogeneous
All of the Above


Which MDX operation returns all possible combinations of one or more sets?
Leaves
AllMembers
Distinct
Crossjoin


What is an example of an OLAP dice operation?
randomizing the year and region
randomizing the year
specifying a particular year and region
specifying a particular year


Which of the following systems is optimized for multidimensional analysis?
OLTP
OLAP
SQL
ETL


Which system is appropriate for recording customer transactions in real time?
Star Schema
Snowflake Schema
OLAP
OLTP


Which of the following is true about Data Warehousing?
It is designed to facilitate reporting and analysis
All of the given options are true
It is a non-volatile time-variant repository
It is copy of transaction data specifically structured for query and analysis


Which of the following is true about Snowflake schema?
It is represented by centralized fact tables
It is a variation of the star schema
Its a logical arrangement of tables in a multidimensional database
All of the above


What is the process by which raw data is migrated into a data warehouse?
Export, Transmit, Load
Extract, Transform, Load
Export, Translate, Load
Extract Test Language


Which of the following uses the idea of a data cube to represent the dimensions of data ?
OLAP or Multidimensional Database
ODS
Relational Database
OLTP


Data Warehouse (OLAP) is based on which of the following?
Fact Constellation Schema
Star Schema
Snowflake Schema
All of given options are valid


Which of the following are valid Logical Extraction methods?
Online extraction
Full Extraction
Both Full Extraction and Incremental Extraction
Incremental Extraction


Which of the following is TRUE about MOLAP?
Data is stored in a multidimensional cube.
MOLAP cubes are built for fast data retrieval
All of the given options are valid
MOLAP cubes are optimal for slicing and dicing operations


Which of the following is true about Data mining?
All of the given options are valid
It is the process of analyzing data from different perspectives and summarizing it into useful information
It is the process of finding correlations or patterns among various fields
It allows users to analyze data from many different dimensions or angles


Which of the following is VALID about time-variant characteristic of a data warehouse?
Time horizon of a data warehouse is significanlty longer than that of operational systems
All of the given options are valid
Data warehouses contain data that is generally loaded from the operational databases on a regular interval
Data Warehouse maintains both historical and (nearly) current data


Which of the following is TRUE about ROLAP?
All of the given options are valid
Can leverage functionalities inherent in the relational database
Performance can be slow
Can handle large amounts of data


Operational Database(OLTP) is based on which of the following?
Snowflake Schema
Fact Constellation Schema
Star Schema
Entity Relationship Model


What table is at the center of a star schema?
fact table
dimension table
core table
metadata table


What is the "grain" of a fact table?
the raw data from which the facts are derived
the ratio of facts to dimensions
the direction along which additive measures can be combined
the most atomic level at which the facts may be defined


Which of the following is true about Specialized SQL servers?
It uses array-based multidimensional storage engines
It facilitates OLTP operations in SQL
It provides advanced query language and query processing support for SQL queries over star and snowflake schemas
It uses the two level of data storage representation to handle dense and sparse data sets


Which of the following operations navigate data from less detailed data to highly detailed data?
Roll-up
Drill-up
Drill-down
Roll-down


A data warehouse is built as a separate repository of data, different from the operational data of an enterprise because?
A data warehouse cannot afford to allow corrupted data within it
It is necessary to keep the operational data free of any warehouse operations
A data warehouse contains summarized data whereas the operational database contains transactional data
It contains data derived from multiple sources


The ‘Slice’ operation deals with?
Merging the cells of all but one dimension
Selecting the cells of any one dimension of data cube
Merging the cells along one dimension
Selecting all but one dimension of the data cube.


Which of the following is true about 'Pivot' operation?
It rotates the data axes in view in order to provide an alternative presentation of data
All of the given options are valid
It is also known as rotation
Two consecutive slice operations in two different dimensions


Modifying an OLAP Cube to view information from only a single year is an example of which operation?
slice
join
dice
pivot


Where does cleansing take place during the ETL phase of data bound for a MOLAP based data warehouse?
Cleanroom Table
Staging Area
ODS
No distinct cleansing phase, data cleansed during MDX queries


Which language is appropriate for querying a non-relational OLAP database?
MDX
SQL
Fortran
SPSS


Which of the following is a method for delta detection during the ETL process?
audit columns
CRC
(none of these choices)
(both of these choices)


Which of the following is an example of a non-aggregatable measure?
costs
probability of default
transaction volume
gross profits


Which of the following is NOT a benefit of data warehousing?
To integrate data from multiple source systems
To maintain data history
To collate data from multiple sources into a multiple databases
To enhance data quality


Choose the most correct statement:
Star and snowflake schema contains two Fact tables
Fact Constellation schema contains two Fact tables
Snowflake schema contains two Fact tables
Star schema contains two Fact tables


What is a Data Mart?
an access layer comprising a subset of a data warehouse
a schema that organizes data into facts and dimensions
an online, open exchange in which organizations can trade business information
a central repository where separate organizations can securely backup data


Which of the following would not be a part of the fact table of a star schema representing sales data?
Units sold
Store UID when a Store dimension table exists
Product name when a Product dimension table exists
None of the Above


Which line is not a valid MDX comment?
/* this line */
-- this line
## this line
// this line


Which of the following environments has a higher amount of Joins
OLTP
Database
OLAP
Data Warehousing env


Which of the following is not a VALID characteristic of data warehouse?
Integrated
Volatile
Subject Oriented
Nonvolatile


What is an OLAP Cube?
a dimensional-reduction operation that summarizes data
None of the Above
a three-dimensional array for Online Analytical Processing
an array in which data is stored and characterized by multiple dimensions


Which of the following can present summary data from an OLAP server?
normalization
primary keys
pivot tables
All of the Above


Which of the following contains historical data?
OLTP
Both OLAP and OLTP
OLAP
Neither OLAP nor OLTP


Which of the following enviroments are in third normal form 3NF?
OLTP
Data Warehousing env
OLAP
All of the options are correct


Which of the following schema is generally the most denormalized?
Reverse Snowflake
Hub and Spoke
Snowflake
Star


Which of the following operation is performed by introducing new dimensions?
Roll-down
Drill-down
Drill-up
Roll-up


Which of the following is a valid disadvantage of a dimensional approach?
Dimensional structures are difficult to understand for business users
The retrieval of data from the data warehouse tends to become slow
Loading the data warehouse with data from different operational systems is complicated
Data warehouse gets complicated for the user to understand and to use


Which of the following is true about 'Dice' operation?
The Dice operation performs selection of two or more dimension on a given cube
The Dice operation performs selection of one dimension on a given cube
It navigates the data from less detailed data to highly detailed data
It forms a new sub-cube by selecting one or more dimensions


Which of the following is true about multidimensional models?
It uses just one level of data storage representation to handle sparse data sets
It uses just one level of data storage representation to handle dense data sets
It uses two level of data storage representation to handle dense and sparse data sets
It uses one level of data storage representation to handle both dense and sparse data sets


Which operation on a multidimensional database is most equivalent to a relational join?
dice
slice
rotating
drill-across


Choose the most correct statement:
Fact Constellation schema contains denormalized Dimension table
Star schema contains normalized Dimension table
Snowflake schema contains denormalized Dimension table
Snowflake schema contains normalized Dimension table


In which of the data storing approaches does transaction data get partitioned into facts?
dimensional approach
Both dimensional and normalized approach
normalized approach
Neither dimensional nor normalized approach


What Amazon Reshift compression encodings would be most appropriate for the columns (Date, City, Description), respectively?
Delta, Byte-dictionary, LZO
MOSTLY8, Runlength, Raw
LZO, Delta, Raw
Byte-dictionary, LZO, Delta


Which of the following is not a type of OLAP?
Multidimensional
Analytical
Hybrid
Relational


Which of the following is a property of Data Warehouse data?
Write Deconditional
Write Only
Read Only
Read and Write


In a 3-tier architecture of the data warehouse, the OLAP engine is related to:
The fourth layer of the architecture
The Second layer of the architecture
The third layer of the architecture
The first layer of the architecture


In order to populate the data warehouse, which of the following set of operations are appropriate?
Query and update
Create and edit
Refresh and load
Insert and delete


Which of following stages involve data update in the warehouse for every transaction performed on the source data?
On time data warehouse
Offline data warehouse
Offline operational data warehouse
Integrated data warehouse


Which of the following operations performs aggregation on a data cube by dimension reduction?
Drill-down
Roll-up
Drill-up
Roll-down


Which of the following applies to Fact Tables?
Partially normalized
Completely normalized
Completely denormalized
Partially denormalized


‘ROLAP’ is preferred over ‘MOLAP’ in which of the following cases?
A data warehouse and relational database are separable
When fast data retrieval is required
When slicing and dicing is required
When there is a requirement to perform complex calculation


What is the principal disadvantage of Type 3 Slowly Changing Data management?
Updates obsolete data with new values infrequently
Does not track historical data
Stores only a finite number of historical changes
Significantly increases storage requirements per record


An ETL Joiner Transformation is performed on Source A with 10,000 rows and Source B with 25,000 rows. Which source should be the master?
Source B to prevent rows from being omitted
Source A to speed up the join process
None of the Above
There is no master-detail distinction, Joiner Transformations gives both sources parity


Which of the following is the most appropriate data structure for indexing a MOLAP Cube?
Index table
Bitmap index
Huffman encoding
B-Tree


Which of the following is not a property of a perspective?
Can securely restrict access to objects in a cube
Can hide any number of dimensions from a cube
Can display hierarchies from a cube
Can provide business-specific viewpoints of a cube


Among the following, which is NOT the objective of clustering?
To initiate hypothesis about the data
To find consistent and valid organizations of the data
It facilitates OLAP operations
To cover natural groupings


Which of the following is not an advisable implementation of division in MDX?
measures.[Value A]/measures.[Value B]
None of the Above
iff(measures.[Value A]=0, null, measures.[Value A]/measures.[Value B])
Divide(measures.[Value A]/measures.[Value B], null)


When is a column-based engine a more appropriate architecture for a Data Warehouse than a row-based engine?
small number of known indexes to be created, static query patterns
a column-based engine is only appropriate for OLTP applications
most columns in a table will be required for a query
dynamic, unpredictable query patterns


Which of the following is not TRUE about subject oriented characteristics of a data warehouse?
Data warehouse has a defined scope
Data should not change once it is entered into the warehouse
Data is manipulated to provide information about a particular subjec
Data that are never deleted once they have been added


Which of the following is an appropriate method for mitigating the database explosion associated with sparse dimensions?
populate the sparse dimension with filler values
apply Type 6 updating to the sparse dimension
create a composite to stand-in for the sparse dimension
isolate the sparse dimension in a separate cube


What are the following is not a requirement of First Normal Form?
Separate tables for each group of related data with a unique identifier for each row
None of the Above
No duplicate columns within a single table
No columns that are not dependent on a primary key


Which of the following is not a valid context for the MDX From clause?
a perspective (subset of measures and dimensions
a subcube (nested MDX query)
None of the Above
a cube


Which of the following environments has more indexes?
OLAP
Data Warehousing env
OLTP
Database


What is true of querying normalized data in a relational database for the purposes of business analysis?
Aggregating relevant data requires the creation of new primary keys
The data must first be placed into a data warehouse via the ETL process
Aggregating relevant data requires multiple table joins
All of the Above


What is a slice in the context of Amazon Redshift?
A logical partition of the data layer shared between distributed compute nodes
A parallel operation isolating a single layer of a Redshift cube
A client-facing partition that isolates different parts of the data warehouse for different types of users in an organization
A partition allowing each of core of a multi-processor node to process the node's workload in parallel


Choose the most correct statement:
OLAP is more flexible then OLTP
Data mining and data warehousing are totally unrelated
Data mining and data warehousing are the same
Data warehousing is the application of data mining


Without deliberate security measures, which of the following MDX clauses is susceptible to injection attack?
WHERE
All of the Above
WITH
FROM


Which of the following is a required property of information passed to the Microsoft Sequence Clustering Algorithm?
normalized data with key sequence columns on each dimension table
sequence information stored as a nested table
star schema with key sequence columns on each dimension table
sequence information stored as a flat table


What property do Router Transformations and Filter Transformations have in common?
All of the Above
single input group transformation
single output group transformation
will not block input rows


Where is the OLAP page pool stored in memory in an Oracle Database 10g and later?
Private SQL Area
SQL Work Area
User Global Area
Program Global Area


Online extraction process falls under which type of data extraction method?
Incremental Extraction
Physical Extraction
Logical Extraction
Full Extraction


A data warehouse is an ‘integrated’ collection of data because?
It is a collection of data of derived from multiple sources
It contains summarized data
It is a collection of data of different types
It is a relational database


All else being equal, which of the following elements of a single MDX WHERE clause will evaluate first?
MEMBER [Measures].[C Less One] AS [Measures].[Measure C] - 1, SOLVE_ORDER = 1
All elements evaluate simultaneously
MEMBER [Measures].[B Twice] AS [Measures].[Measure B] + [Measures].[Measure B], SOLVE_ORDER = 4
MEMBER [Measures].[A Squared] AS [Measures].[Measure A] * [Measures].[Measure A], SOLVE_ORDER = 2