Welcome to Scribd!

Pyspark Questions

Uploaded by

0% found this document useful (0 votes)

11 views2 pages

This document outlines a framework to apply rules to streaming data from Kafka and detect rule breaks. It describes the input data structure, rules table, and expected output. The framework would apply rules from the RULES table to incoming Kafka stream data and write to a RULE_BREAK table if a rule is broken for a certain number of consecutive times. It provides sample input data and expected output.

Original Description:

Original Title

Pyspark questions

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

11 views2 pages

Pyspark Questions

Uploaded by

k2sh

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

Create a PySpark or Any Python distributed driven based framework, which will apply the rules over the

incoming kafka stream data ,

here apply the rules present in RULE table and create an entry to any persistent storage with logical table as (RULE_BREAK) if rule break
occurs ?

We have only one data source with signal range signal which is producing the data every 1 minute as kafka streams and we have below 3
or n number of tags with their data type

Table : TAG : Definition for the sensor tag

--------------------
tag_id -> tag id for a signal
tag_name -> tag name for a signal
data_type -> data type it's restricted to string, int, double

tag_id | tag_name | data_type

1 | t1 | double
2 | t2 | int
3 | t3 | String

Table: RULE : Definition for the rule break expression

--------------------
rule_id -> tag id for a signal
rule_name -> tag name for a signal
rule_expression -> expression for the rule need to be applied ,Only Operators we need to support are > , = , < , != as per the data types
rule_break_count -> number of rule break

# Note – for each rule break definition , we should use only one tag , such as tag t1 > 4 occurs for 4 times

rule_id rule_name rule_expression rule_break_count rule_description

1 t1_double_rule t1 > 55.43 4 if t1 > 55.43 for consecutive 4 times it's rule break
2 t2_int_rule t2 > 20 6 if t2 < 20 for consecutive 6 times it's rule break
3 t3_string_rule t3 = ON 3 if t3 = ON for consecutive 3 times it's rule break

Table : Output : RULE_BREAK schema should contains below fields

---------------------------
rule_id -> rule id which break
rule_break_stop_timestamp -> it's a timestamp at which rule break count satisfied the criteria , we need. to capture the stop time when
rule break condition satisfied. While streaming , we can have multiple rule breaks for each definition depending upon it’s criteria

Example :
Kafka Streaming Data Sample would be like given below :- You can generate your own data for testing
{ timestamp : 1571053218000 , { t1 : 55.23 , t2 : 10 , t3 :'ON' } }
{ timestamp : 1571053278000 , { t1 : 63.23 , t2 : 11 , t3 :'OFF' } }
{ timestamp : 1571053338000 , { t1 : 73.23 , t2 : 12 , t3 :'ON' } }
{ timestamp : 1571053398000 , { t1 : 83.23 , t2 : 13 , t3 :'ON' } }
{ timestamp : 1571053458000 , { t1 : 20.23 , t2 : 14 , t3 :'ON' } }
{ timestamp : 1571053518000 , { t1 : 30.23 , t2 : 25 , t3 :'OFF' } }
So on . . .

Result would be-

Rule break happened for 1 and 3 only , so it will results with below output
rule_id | rule_break_stop _timestamp
1 | 1571053398000
3 | 1571053458000

Note:-
We only create entry to the RULE_BREAK table , only if the condition satisfied for n number of consecutive rule break for the streaming
records.
if the condition is not satisfied in current record , we should reset it and again apply the pattern/rule break conditions

▪ Briefly describe the conceptual approach you chose! What are the trade-offs?
▪ What's the runtime performance? What is the complexity? Where are the bottlenecks?
▪ If you had more time, what improvements would you make, and in what order of priority?

ATPG Simulation Mismatch - Common Problems and Solutions
Document6 pages
ATPG Simulation Mismatch - Common Problems and Solutions
Naga Nithesh
100% (1)
AZ-900 Exam - Free Actual Q&as, Page 1 - ExamTopics
Document62 pages
AZ-900 Exam - Free Actual Q&as, Page 1 - ExamTopics
Goran Stefanoski
No ratings yet
GEH-6403 Control System Toolbox Mark VI Turbine Controller
Document316 pages
GEH-6403 Control System Toolbox Mark VI Turbine Controller
jorge lopez
100% (1)
Jtag System: With Openocd Explanation
Document17 pages
Jtag System: With Openocd Explanation
Mohammed Publications
100% (1)
Moshell Commands
Document4 pages
Moshell Commands
Rui Belo
No ratings yet
Parameter Standarization Guideline
Document21 pages
Parameter Standarization Guideline
Agus Bule
100% (2)
Department of Software Engineering: Faculty Member: Ma'am Quartulain Dated: 3-20-2021
Document9 pages
Department of Software Engineering: Faculty Member: Ma'am Quartulain Dated: 3-20-2021
Muhammad Rehan
No ratings yet
AIM Vs OUM
Document6 pages
AIM Vs OUM
gvsekar
No ratings yet
Gas Turbine: MARK V: Effect of PCD Transducer Failure??
Document3 pages
Gas Turbine: MARK V: Effect of PCD Transducer Failure??
HBNBIL
No ratings yet
TQCSETUP
Document110 pages
TQCSETUP
steva037
No ratings yet
B308 Tpump
Document23 pages
B308 Tpump
ranusofi
No ratings yet
Transformando La Movilidad Urbana en Mexico2
Document4 pages
Transformando La Movilidad Urbana en Mexico2
Luu BeHl
No ratings yet
TCP Timers
Document24 pages
TCP Timers
Aryan Gupta
No ratings yet
PS IRAT U2G Optimization 2
Document18 pages
PS IRAT U2G Optimization 2
buscandoconocerte
No ratings yet
Tpump: After Completing This Module, You Will Be Able To
Document21 pages
Tpump: After Completing This Module, You Will Be Able To
sasikannadev
No ratings yet
OpenSTA 2
Document81 pages
OpenSTA 2
sam.sarkar
No ratings yet
Logging and Trending Data
Document15 pages
Logging and Trending Data
hmd23
No ratings yet
Setup and Use The AVR® Timers
Document16 pages
Setup and Use The AVR® Timers
mike_helpline
No ratings yet
PIC16f877a Timer
Document9 pages
PIC16f877a Timer
john moron
100% (2)
Digsilent Powerfactory: Technical Reference Documentation
Document11 pages
Digsilent Powerfactory: Technical Reference Documentation
VladimirCoello
No ratings yet
Quick Stata Guide
Document22 pages
Quick Stata Guide
Hoo Suk Ha
No ratings yet
FlexCAN Bit Timing Parameters
Document8 pages
FlexCAN Bit Timing Parameters
ravillakiran56
No ratings yet
3 Constant Failure Rate Models
Document21 pages
3 Constant Failure Rate Models
eeit_nizam
No ratings yet
Module
Document14 pages
Module
Pablo Sari
No ratings yet
Timers Mobility GSM
Document21 pages
Timers Mobility GSM
Denny A Mukhlis
No ratings yet
Timer Wheel PDF
Document24 pages
Timer Wheel PDF
Sarah Coleman
No ratings yet
AVR Timers - TIMER0: Problem Statement
Document7 pages
AVR Timers - TIMER0: Problem Statement
Farid Baka
100% (1)
Quant Strat Trade R
Document180 pages
Quant Strat Trade R
Charles Barony
No ratings yet
N Modular Redundancy
Document15 pages
N Modular Redundancy
Shambhu Khanal
No ratings yet
How To Run Guds
Document4 pages
How To Run Guds
unix admin
No ratings yet
Sta Notes 1671631469 PDF
Document47 pages
Sta Notes 1671631469 PDF
Aathith Saiprasad
No ratings yet
Atpg Coverage Loss
Document4 pages
Atpg Coverage Loss
Umesh Parashar
No ratings yet
Innodb Innodb: If If Case
Document3 pages
Innodb Innodb: If If Case
Irwan Fath
No ratings yet
PWM Control
Document28 pages
PWM Control
Agus Lesmana
100% (2)
Configurar Session-Ttl en Fortigate
Document2 pages
Configurar Session-Ttl en Fortigate
masmisem
No ratings yet
Msp430fr2433 Timers
Document31 pages
Msp430fr2433 Timers
Sarathy S
No ratings yet
Spra 275
Document13 pages
Spra 275
cointoin
No ratings yet
Metastability: in Altera Devices
Document11 pages
Metastability: in Altera Devices
kapilpatel
No ratings yet
Modeling of Definite Time Over-Current Relay Using MATLAB: Power System Protection
Document5 pages
Modeling of Definite Time Over-Current Relay Using MATLAB: Power System Protection
Hayat Ansari
No ratings yet
Avr C Timer0 PDF
Document9 pages
Avr C Timer0 PDF
Nicolás Ferrario
100% (1)
Books
Document14 pages
Books
sravya
No ratings yet
Module 5
Document19 pages
Module 5
Aditya Agarwal
No ratings yet
LPC2148 Timers
Document4 pages
LPC2148 Timers
Anand Bhaskar
No ratings yet
Reliabilty
Document14 pages
Reliabilty
Hamada Rageh
No ratings yet
PLC Timer's Instruction Is Used To Activate or Deactivate A Device After A Preset Interval of Time. Types of Timers Available Are
Document11 pages
PLC Timer's Instruction Is Used To Activate or Deactivate A Device After A Preset Interval of Time. Types of Timers Available Are
Somnath Biswal
No ratings yet
Real Timer Using Microcontroller ATME8535
Document4 pages
Real Timer Using Microcontroller ATME8535
Emin Kültürel
No ratings yet
PWM Freq Arduino Due
Document9 pages
PWM Freq Arduino Due
Edgar Eduardo Medina Castañeda
No ratings yet
Amctimer A
Document4 pages
Amctimer A
Arnqv
No ratings yet
Backend GTL
Document16 pages
Backend GTL
carver_ua
No ratings yet
A Tutorial On Timing Equations by S Schwartz (TUG 2000 Paper)
Document16 pages
A Tutorial On Timing Equations by S Schwartz (TUG 2000 Paper)
domingouclaray
No ratings yet
Guide Timer MSP430
Document13 pages
Guide Timer MSP430
Tatiana Leal del Río
No ratings yet
Embd Missing Topics Units-1,2,3,4
Document24 pages
Embd Missing Topics Units-1,2,3,4
Surya Venkat
No ratings yet
App rh850 BMC
Document14 pages
App rh850 BMC
Sáng tạo mỗi ngày
No ratings yet
MSP430 Timers and PWM
Document13 pages
MSP430 Timers and PWM
anhhungken
No ratings yet
Example of Autonomous Transactios
Document6 pages
Example of Autonomous Transactios
RAJIV MISHRA
No ratings yet
Using The New Input Transition / Input Capture TPU Function (NITC) With The MPC500 Family
Document20 pages
Using The New Input Transition / Input Capture TPU Function (NITC) With The MPC500 Family
err
No ratings yet
Conjoint Analysis
Document20 pages
Conjoint Analysis
dassdeepak69
No ratings yet
Energy Meter Reading Protocol AND Calibration Protocol VTEMS11-VT11
Document15 pages
Energy Meter Reading Protocol AND Calibration Protocol VTEMS11-VT11
Narender Saineni
No ratings yet
Timers and Interrupts Experiment
Document7 pages
Timers and Interrupts Experiment
Prakash Prasad
No ratings yet
ASE2000 Version 2 User Guide DNP3 Serial and LAN/WAN Protocols
Document30 pages
ASE2000 Version 2 User Guide DNP3 Serial and LAN/WAN Protocols
jacastillo68
No ratings yet
Microcontroller and Its Appl Ans (NEC 022-IiS)
Document5 pages
Microcontroller and Its Appl Ans (NEC 022-IiS)
Sachin Pal
No ratings yet
AVR 133: Long Delay Generation Using The AVR Microcontroller
Document8 pages
AVR 133: Long Delay Generation Using The AVR Microcontroller
nicoletabytax
No ratings yet
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Matrices with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
Rating: 3 out of 5 stars
3/5 (4)
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Porty
Document18 pages
Porty
k2sh
No ratings yet
Datagu
Document20 pages
Datagu
k2sh
No ratings yet
Oiuyfr
Document36 pages
Oiuyfr
k2sh
No ratings yet
SQL9097
Document8 pages
SQL9097
k2sh
No ratings yet
Uplokm
Document3 pages
Uplokm
k2sh
No ratings yet
SDM Doc
Document1 page
SDM Doc
k2sh
No ratings yet
Yuhik
Document10 pages
Yuhik
k2sh
No ratings yet
Pouyt
Document2 pages
Pouyt
k2sh
No ratings yet
Lamffda
Document20 pages
Lamffda
k2sh
No ratings yet
Affinity
Document7 pages
Affinity
k2sh
No ratings yet
Pyspavg
Document3 pages
Pyspavg
k2sh
No ratings yet
Docer
Document44 pages
Docer
k2sh
No ratings yet
Figre
Document3 pages
Figre
k2sh
No ratings yet
Windofhnction
Document12 pages
Windofhnction
k2sh
No ratings yet
SQL90 GH 97
Document5 pages
SQL90 GH 97
k2sh
No ratings yet
AWSStep Functs
Document2 pages
AWSStep Functs
k2sh
No ratings yet
Datadgeling
Document22 pages
Datadgeling
k2sh
No ratings yet
Docerhg
Document39 pages
Docerhg
k2sh
No ratings yet
Windonction
Document16 pages
Windonction
k2sh
No ratings yet
Datadeling
Document27 pages
Datadeling
k2sh
No ratings yet
Athffna
Document8 pages
Athffna
k2sh
No ratings yet
Pyspa
Document6 pages
Pyspa
k2sh
No ratings yet
Lambda
Document24 pages
Lambda
k2sh
No ratings yet
Lamda
Document24 pages
Lamda
k2sh
No ratings yet
Athna
Document13 pages
Athna
k2sh
No ratings yet
AWS Step Functs
Document4 pages
AWS Step Functs
k2sh
No ratings yet
Docker
Document44 pages
Docker
k2sh
No ratings yet
Athena
Document13 pages
Athena
k2sh
No ratings yet
Data Modeling
Document27 pages
Data Modeling
k2sh
No ratings yet
AWS Step Functions
Document4 pages
AWS Step Functions
k2sh
No ratings yet
Object-Oriented Programming (CS F213) : BITS Pilani
Document13 pages
Object-Oriented Programming (CS F213) : BITS Pilani
SAURABH MITTAL
No ratings yet
C++ Module 2
Document58 pages
C++ Module 2
Kaye Cariño
No ratings yet
Cyber Security Incident Response Template
Document12 pages
Cyber Security Incident Response Template
Donald
No ratings yet
Advance .Net Technology - ASP .Net - 13102015 - 1043000AM - 13102015 - 053841AM
Document32 pages
Advance .Net Technology - ASP .Net - 13102015 - 1043000AM - 13102015 - 053841AM
tarang
No ratings yet
Web1,2 Units
Document42 pages
Web1,2 Units
kskchari
No ratings yet
Primary Master Server
Document14 pages
Primary Master Server
Eranda Peiris
No ratings yet
Programming Fundamentals Using C/C++: (3 Lectures)
Document2 pages
Programming Fundamentals Using C/C++: (3 Lectures)
COMPUTER SCIENCE BGC
No ratings yet
Details PDF: Immagine Numero Di Parte Descrizione Fabbricante Quantità
Document1 page
Details PDF: Immagine Numero Di Parte Descrizione Fabbricante Quantità
Tom Tom
No ratings yet
Women Safety Device With Gps Tracking and Alerts.
Document22 pages
Women Safety Device With Gps Tracking and Alerts.
Ravikumar Tamilvanan
No ratings yet
Digi Router
Document384 pages
Digi Router
Argenis Vera
No ratings yet
Test Cases - Ultimate - TheTestingAcademy
Document56 pages
Test Cases - Ultimate - TheTestingAcademy
priyanka081708
No ratings yet
B. Tech. Mechatronics Introd.+Schem+Syllabus 2019 - Onwards
Document34 pages
B. Tech. Mechatronics Introd.+Schem+Syllabus 2019 - Onwards
R.Deepak Kanna
No ratings yet
Microlok II System Startup, Troubleshooting, and Maintenance
Document207 pages
Microlok II System Startup, Troubleshooting, and Maintenance
Jay Jay
No ratings yet
User Interface Analysis & Design
Document11 pages
User Interface Analysis & Design
Riani
No ratings yet
Career Objective:: Srinivas. M
Document3 pages
Career Objective:: Srinivas. M
Nagaraju katta
No ratings yet
Manual
Document193 pages
Manual
get getr
No ratings yet
BI Publisher RTF Running Total Report
Document5 pages
BI Publisher RTF Running Total Report
Ishaq Ali Khan
No ratings yet
Log
Document1,924 pages
Log
Hevi Januardi
No ratings yet
Tvl-Ict-Css: Quarter 3 - Module 7-8: Installing and Configuring Computer System (Iccs)
Document26 pages
Tvl-Ict-Css: Quarter 3 - Module 7-8: Installing and Configuring Computer System (Iccs)
juanito zamora
100% (3)
Bioelex Pr4
Document9 pages
Bioelex Pr4
Eagle Cool
No ratings yet
Wolf Crypt
Document1 page
Wolf Crypt
wolfSSL
No ratings yet
Evolution of AI
Document2 pages
Evolution of AI
kofyegomlo
No ratings yet
Ctk-5000 Sevice Manual
Document15 pages
Ctk-5000 Sevice Manual
Ishwar Hegde
No ratings yet
BookSellerFlyer 9781484241660
Document1 page
BookSellerFlyer 9781484241660
超揚林
No ratings yet
Course Code CS: I P, N & P
Document47 pages
Course Code CS: I P, N & P
akash pandey
No ratings yet