Professional Documents
Culture Documents
ACA Bigdata QUIZ
ACA Bigdata QUIZ
ACA Bigdata QUIZ
100%
1.Question 1
The maximum SQL length allowed in MaxCompute varies with the environment resources that are
applied.
MaxCompute SQL is suitable for massive data (GB, TB, EB level), off-line batch calculation scenarios.
2.Question 2
Currently, MaxCompute supports specifying up to 6 small tables in a mapjoin, otherwise syntax errors
are reported, and records for a single small table are limited to no more than 10,000.
True
False
3.Question 3
True
False
4.Question 4
When you have multiple tables join, it only allows one leftmost table to be a mapjoin table.
True
False
5.Question 5
Data type inconsistency is often encountered in business data processing. In order to keep data type
consistent, data processing system involves data type conversion. If MaxComputeSQL is used for data
processing, which of the following conversion can not be achieved?
Bigint to String
String to Boolean
String to Bigint
Datetime to String
6.Question 6
FALSE or TRUE=TRUE
TRUE or NULL=TRUE
7.Question 7
Suppose table t_dml only has one field named as id (type is string), which MaxCompute SQL query
cannot be executed correctly?
Correct
8.Question 8
Which UNION ALL statements in MaxCompute SQL are correct? (Number of Correct Answers: 3)
Combines two or multiple data sets returned by a SELECT operation into one data set. If the result
contains duplicated rows, all rows that meet the conditions are returned, and deduplication of
duplicated rows is not applied.
MaxCompute does not support union two main query results, but you can do it on two subquery results.
The columns of each sub query corresponding to the union all operation must be listed, not supporting
*.
The number, names, and types of queried columns corresponding to the UNION ALL/UNION operation
must be consistent.
9.Question 9
10.Question 10
Which of the following statements of mapjoin Hint application of MaxCompute SQL are correct ?
(Number of Correct Answers: 3)
When a large table joins one or multiple small tables, you can use MapJoin, which performs much faster
than regular Joins
When Mapjoin references to a small table or sub query, alias should be referenced.
MaxCompute SQL does not support the use of complex Join conditions such as unequal expressions, or
logic in normal Join's on conditions, but in MapJoin it can
When do multiple tables Join, the two left tables can be MapJoin tables at the same time.
1.Question 1
True
False
2.Question 2
Java UDF supports Bigint, String, Double, Boolean , ARRAY, MAP, and STRUT
True
False
3.Question 3
UDF output a return value at a time. UDTF can output more than two records at one time.
True
False
4.Question 4
For UDAF(User Defined Aggregation Function), input and output is many-for-one relationship.
True
False
5.Question 5
The NULL value in SQL is represented by a NULL reference in Java; therefore, ‘Java primitive type’ is not
allowed because it cannot represent a NULL value in SQL.
True
False
6.Question 6
UDF
UDAF
UDGF
UDTF
7.Question 7
Which correspondence is incorrect between MaxCompute data type and Java data type?
Need to specify the running data source when run UDF locally.
UDF/UDAF/UDTF typically works on some columns of the table in the SELECT clause, and needs to
configure MaxCompute project, table, and column when run local test.
Warehouse is built locally to store tables (including meta and data) or resources for executing UDF
locally. The project name, tables, table name and sample data are under the warehouse directory in
order.
9.Question 9
Which of the following ways of UDTF usage in SQL are correct? (Number of Correct Answers: 3)
select user_udtf(col0, col1, col2) as (c0, c1) from (select * from my_table distribute by key sort by key) t;
select reduce_udtf(col0, col1, col2) as (c0, c1) from (select col0, col1, col2 from (select map_udtf(a0, a1,
a2, a3) as (col0, col1, col2) from my_table) t1 distribute by col0 sort by col0, col1) t2;
10.Question 10
Which of the following UDTF statements are correct? (Number of Correct Answers: 2)
It does not support the use with group by together in the same SELECT clause.
It supports the use with distribute by together in the same SELECT clause.
It does not support the use with sort by together in the same SELECT clause.
Data Visualization Quiz
TOTAL POINTS 10
1.Question 1
As an important Platform as a service (PaaS) product in Alibaba Cloud product portfolios. Alibaba Cloud
DataWorks offers its users a one-stop solution, which of the following capability is not included in the
solution?
Data Integration
Data Management
Data Governance
2.Question 2
If today's date is 2019-03-11, what is the result if the Partition Expression is set to dt=$[yyyymmdd-1]
when creating rule configuration in Data Quality?
dt=20190310
dt=20190311
dt=20190312
dt=20190311-1
3.Question 3
Function Studio allows you to edit MaxCompute Java user-defined functions (UDFs) and to compile and
publish them to DataWorks with one click.
True
False
4.Question 4
Which of the following notification method is not supported by the Data Quality?
DingTalk
Email
WeChat
SMS
5.Question 5
Which of the following tasks is supported for being associated with Data Quality?
Scheduling Tasks
6.Question 6
What alarm levels are supported by DataWorks Data Quality Control component? (Number of correct
answers: 2)
7.Question 7
Function Studio in DataWorks supports UDF(user define function), ______ and ______
templates.
MapReduce function
Flink function
8.Question 8
The process of using data quality is to configure monitoring rules for existing tables. After you configure
a rule, what can be done to verify the rule?
Run a Trial
Test
Run A/B Test
9.Question 9
DataService Studio works together with _________ to provide a secure, stable, low-cost and easy-to-use
data sharing service.
10.Question 10
Which of the following products is one of the underlying computing engine layer of DataWorks?
MaxCompute
OSS
Polar DB
Quick BI Quiz
TOTAL POINTS 10
1.Question 1
Delete
Edit
Copy
Move
2.Question 2
Which local file type is not supported when using local files as the data source of QuickBI ?
CSV
XLS
XLSX
TXT
3.Question 3
The exploration space is a dedicated storage area of Quick BI. It supports txt, CSC, Excel, DataWorks.
True
False
4.Question 4
True
False
5.Question 5
In QuickBI, when use a local Excel file contains multiple sheets as the data source, all sheets can be
uploaded at once.
True
False
6.Question 6
Which chart is suitable for comparing the sales situation of a commodity in various regions?
Gauge
Bar Chart
Card Chart
Scatter Chart
7.Question 7
Color Legend
X axis
Y axis
8.Question 8
Which of the following statements are correct for dashboard in QuickBI? (Number of Correct Answers:
3)
The dashboard supports two modes:Standard dashboard & Full -Screen mode .
You can simply edit the dataset accordingly to meet the actual dashboard demands.
9.Question 9
10.Question 10
Making reports
Business exploration
Self-help data acquisition
Data synchronization
1.Question 1
Machine Learning Platform For AI provides end-to-end machine learning services, including data
processing, feature engineering, model training, model prediction, and model evaluation. Machi ne
Learning Platform For AI combines all of these services to make AI more accessible than ever.
True
False
2.Question 2
True
False
3.Question 3
The Read MaxCompute Table component is unaware of any modifications (such as add or remove a
column) made to a table that is already loaded to the component.
True
False
4.Question 4
Feature engineering includes feature derivation and scale change. The heart disease prediction project
uses the feature selection and data normalization components for feature engineering.
True
False
5.Question 5
The Read MaxCompute Table component enables you to read data from MaxCompute tables. To read a
table from another project that you are authorized to access, you can use the format of _________.
6.Question 6
The total service fee is equal to the billing fee of the component you use multiplied by the number of
computing hours. The computing hours are measured by using the formula ______
7.Question 7
A SQL Script Component supports a maximum of _____ input port(s) and one output port.
8.Question 8
XML
JSON
CSV
XLS
9.Question 9
Sampling data in PAI is generated in the weighted mode. The weight column must be of ____ or ____
type.
Double
Boolean
Int
String
10.Question 10
The processor layer of PAI is the infrastructure layer that consists of ______ and _____ clusters.
CPU
MPI
GPU
MapReduce