Helpful

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 9

10)Select the correct differences between XML attributes and elements.

➔Elements can contain tree structure

Elements can have multiple values

11) Which of the following statements correctly defines the characteristics of the Kimball model?

➔ In the Kimball model the analytical systems can access the data directly

Kimball model uses the dimensional model

12)Specify some of the essential logical components of a data warehouse.

➔Entities Attributes

13) if we have not specified ASC or DESC after a SQL ORDER BY clause, the following is used by default

-ASC

14) 5. The SQL keyword(s)______ is used with wildcards.

a) LIKE only b) IN only c) NOT IN only d) IN and NOT in

15)Find the SQL statement below that is equal to the following: SELECT NAME FROM CUSTOMER WHERE
STATE = 'VA';

>SELECT NAME IN CUSTOMER WHERE STATE IN ('VA');

SELECT NAME IN CUSTOMER WHERE STATE = 'VA';

SELECT NAME IN CUSTOMER WHERE STATE = 'V';

SELECT NAME FROM CUSTOMER WHERE STATE IN ('VA');

Answer (Detailed Solution Below)Option 4 : SELECT NAME FROM CUSTOMER WHERE STATE IN ('VA');

Concept:The given SQL statement is, SELECT NAME FROM CUSTOMER WHERE STATE = 'VA';The given
SQL query is to find the name of the customer whose state is VA.IN operator:The SQL IN condition
(sometimes called the IN operator) allows us to easily test if an expression matches any value in a list of
values. It is used to help reduce the need for multiple OR conditions in a SELECT, INSERT, UPDATE or
DELETE statement.SQL : SELECT NAME FROM CUSTOMER WHERE STATE IN ('VA');This query also finds
the name of the customer whose state name is 'VA' by using IN operator. IN operator ) allows us to
easily test if an expression matches any value in a list of values. So in the given query, the list has only
one value.Hence the correct answer is SELECT NAME FROM CUSTOMER WHERE STATE IN ('VA');

16) which of the following is not a valid numpy function ?

 numpy. linspace. The numpy. ...


 numpy. digitize. ...
 numpy. repeat. ...
 numpy. random. ...
 numpy. polyfit. ...
 numpy. polyval. ...
 numpy. nan.
Numpy.histogram

Numpy.squeeze

Numpy.argmax

Numpy pop is not valid

17) valid syntax for creating cursor mcq

All of the above

18) Output of below code

S=”Accnture”

Print(s[5])

19)

Which of the following are valid data types that can be stored in a data lake?

Ans: Unstructured data, Structured data, Semi-structured data


Data Lakes allow you to store relational data like operational databases and data from
line of business applications, and non-relational data like mobile apps, IoT devices,
and social media. They also give you the ability to understand what data is in the lake
through crawling, cataloging, and indexing of data.
A data warehouse typically holds highly structured data, while a data lake can
hold structured, semistructured and unstructured data.

20) Which of the following is not a characteristic of a data lake?

Ans: Data is not searchable easily

21) What could be accomplished by using big data technologies?

➔Cost reductions

New product development and optimized offerings


22) Which of the following options can be used to do pattern based search?

a) cut

b) head

c) tail

d) grep

23) In how many different forms/structures BigData could be found?

a) 2

b) 3

c) 4

d) 5

24)file1.txt should not be readable by others but should be readable and writable by owner and
group members. What command?

chmod 660 File1.txt

25)When implementing your data governance practice in your organization,which is the one factor
that will NOT play a major role when considering the implementation?

a) Data quality

b) Domain integration

c) Reusability

d) Business value

26) TEstDir directory has 3 files stored in it. Which of the following option can be used to delete
TestDir?

rm -r TestDir

27) Select the tool that allows specifying the relation of multiple tables as data sources when
reading data from a database as input in Talend studio.

a) Database Custom Schema

b) SQL Mapper

c) TableJoin

d) SQL Builder
28)WHICH IS NOT A DATA QUALITY AIMED PROJECT INITIATIVE?

ANS: DATA INTEGRATION

29) Select the correct differences between XML attributes and elements.

a) Elements can have multiple values

b) Elements can contain tree structure

c) Attributes can have multiple values

d) Attributes can contain tree structure

30) The benefits of a standard relational language include which of the following?

a) Reduced training costs

b) Increased dependence on a single vendor

c) Application are not needed

31) To remove duplicate rows from the results of an SQL SELECT statement, the qualifier specified
must be included.

a) ONLY

b) SINGLE

c) DISTINCT

d) UNIQUE

32) Which of the following is not a valid Pandas function?

a) dataframe.drop()

b) dataframe.dropna()

c) dataframe.insert()

d) dataframe.fillna()

33) When data is saved in Azure Data Lake, how many copies of data are saved by default?

a) 1

b) 2

c) 3

d) 4
34) Python code can connect to which databases

a) Oracle

b) mysql

c) Sqlite

d) All of the above

*35) Choose 3 appropriate options which are true about String literals

a) String literals point to the same object in the string constant pool

b) while comparing literals = and equals() will return true

c) String literals are stored in Heap

d) String literals are used to represent a non sequence of characters

*36) Choose 4 appropriate options which are true about Global Distribution in Cosmos DB.

a) To achieve low latency and high availability, instances of these applications need to be deployed
in datacenters that are close to their users

b) enables writes across all regions with automatic fallover

c) Dynamically adding and removing regions is not allowed

d) Azure Cosmos DB is a globally distributed database system that allows us to read and w data from
the local replicas of our database

e)Azure Cosmos DB transparency replicates the data to all the regions associated with our Cosmos
account
37)

38)
39) which of the following is not a core data type in python language?

CLASS

40) Which of the following is not a DDL command?

a) TRUNCATE

b) UPDATE

C) ALTER

D) CREATE

41) Which of the following are valid data types that can be stored in a data lake?

a) Structured data b) Un Structured data c) Semi-structured data

42)if we have not specified ASC or DESC ASC is by default.

43) What will be the best approach of data management for a multi-faceted enterprise with multiple
domains?

a) Top-down approach b) Bottom-up approach c) Middle out approach

d) A combination of top-down and middle out approach

44)to change Thomas into Michel in the LastNAme column


UPDATE Users SET Lastname=”Michel” WHERE LastNAme=”Thomas”

45)Essential features of Strategic Information

Preserves Data Integrity and Time Variant

46)which of the following syntax is valid to add a column to the dataframe?

df[‘Contact’]=pd Series…

47) Assuming the command import numpy as np, create an array with size as 5 rows and 3 columns
and initialize all array elements to zero Choose most appropriate command

np.zeros((5,3))

48) Which of the following is correct option in case of incoming queries, when a warehouse does not
have enough resources available to process the queries?

Snowflake automatically resizes the warehouse

49) When scaling up a Snowflake warehouse, what is the scaling factor when moving between T-shirt
sizes?
2
*50) Choose 4 appropriate features of Cosmos DB Gremlin API from the given options

Elastically Scalable throughput stage

Fully managed Graph Database

Manual indexing

Compatibility with Apache TinkerPop

The Gremlin API in Azure Cosmos DB uses the Gremlin query language

51) Choose an appropriate option which is valid about availability zone in Azure

One or more data centers equipped with independent power coding and networking

A collection of software that can enable high scalability at short notice.

A set of data centers close together.

One or more data centers equipped together to provide backup

*52)
ABC Pvt ltd company plans to use an Azure Cosmos SQL DB for data storage. The company plans to
use partitioning to meet application performance goals. we need to venity Cosmos DB partitioning
features to ensure that it will meet the company's requirements all that apply

The correct options that apply are:

Partitioning is automatic and managed transparently by Azure Cosmos DB


A single logical partition can contain no more than 20 GB of data
Microsoft guarantees a minimum of 1000 request units per second (RUs)

53)
The HAVING clause does which one of the following
Acts like a WHERE clause but is used for groups rather than rows
54)which statement is true regarding routines and triggers
Both are stored in the databases
55) What do we use to define a block of code in Python language?

style="padding: 0.2em list-style-type lower alpha line-height: 25px; color: rgb(0, 0, 0), margin-
top.4pm;”>Key

Brackets

style="padding: 0.2em; list-style-type lower-alpha, line-height 25px; color: rgb(0, 0, 0); margin-
top: 4px">Indentation

56) what is meant by reference data?


Data that is used for classifaction of other data
57) Which of the following is a valid syntax for creating cursor?

connection getcusor

connection.cursor

sqllite.cursor()

The following is a valid syntax for creating a cursor in Python for SQLite:

connection.cursor()
In this syntax, connection is a variable that holds the connection object to the SQLite database. The
cursor() method is called on the connection object to create a new cursor object that can be used to
execute SQL statements and fetch results from the database.
58) WHICH IS NOT A PRIMARY RULE FOR DATA VALIDATION?
ANS: GOVERNANCE PRACTICES

59) which of the following command can be used to create a soft link to an existing file?
Soft links are created with the ln command. For example, the following
would create a soft link named link1 to a file named file1, both in the
current directory
$ ln -s file1 link1

Ln Command
Ln Command to Create Symbolic Links
Use the -s option to create a soft (symbolic) link. The -f option will force the command to overwrite a
file that already exists. Source is the file or directory being linked to.

You use the ln command to create the links for the files and the -s option to specify that this will
be a symbolic link.

You might also like