Professional Documents
Culture Documents
Kaggle Problems 1
Kaggle Problems 1
Kaggle Problems 1
Can you help protect more than one billion machines from
Microsoft Malware Prediction damage BEFORE it happens?
With more than one billion enterprise and consumer customers, Microsoft takes this problem very
seriously and is deeply invested in improving security.
As one part of their overall strategy for doing so, Microsoft is challenging the data science
community to develop techniques to predict if a machine will soon be hit with malware. As with
their previous, Malware Challenge (2015), Microsoft is providing Kagglers with an unprecedented
malware dataset to encourage open-source progress on effective techniques for predicting
malware occurrences.
Can you help protect more than one billion machines from damage BEFORE it happens?
Acknowledgements
This competition is hosted by Microsoft, Windows Defender ATP Research, Northeastern University
College of Computer and Information Science, and Georgia Tech Institute for Information Security &
Privacy.
Dataset Description
The goal of this competition is to predict a Windows machine’s probability of getting infected by various
families of malware, based on different properties of that machine. The telemetry data containing these
properties and the machine infections was generated by combining heartbeat and threat reports collected by
Microsoft's endpoint protection solution, Windows Defender.
Columns
Unavailable or self-documenting column names are marked with an "NA".