03 134202 109 11507886537 02012024 053617pm

Group: Muhammad Danish Shahid (03-134202-048)
Waqas (03-134202-090)
Labeeb Mirza (03-134202-109)
EEM PROJECT ASSIGNEMENT
Deliverable 1 of project
Each group has to choose only single one data-set link to download specific data-set and related
papers from given link to improve or reduce gaps in problem of the work as an artificial
intelligence lab final projects.
Task 1
Student has to give project into 4 deliverable as in deliverable 1 student has to download “20”
related research articles and ready a related work or evidence work enlisting details about already
used techniques accuracy and classification results etc..
Article Title Data-set Objective Techniques Results Contribution Gaps

Name Used
"A Neural RankClick Develop a Neural CTR Introduce a Scalability

Attention Model dataset neural attention Networks improveme novel challenges
for Learning to model for with nt: 15% attention- with large ad
Rank" improved Attention based ranking inventory
ranking in Mechanisms model
online
advertising
"Learning to eCommRan Adapt learning Gradient- Conversion Tailor Address

Rank for E- k dataset to rank for e- Boosted rate learning to cold-start
commerce commerce Decision improveme rank to e- problem for
Product product Trees nt: 20% commerce new
Recommendatio recommendatio settings products
ns" ns
"Deep RecDeepRL Explore deep Deep MRR Present a new Limited user
Reinforcement dataset reinforcement Reinforceme improveme approach to engagement
Learning for learning nt Learning nt: 0.1 learning to analysis
Learning to techniques for (DDPG) rank with
Rank in personalized reinforcement
Recommendatio recommendatio
Waqas (03-134202-090)
Labeeb Mirza (03-134202-109)
ns" ns learning
"Learning to ClickRank Investigate Click Models NDCG@5 Combine Need for

Rank with Click dataset learning to (Position- improveme learning to more
Models" rank techniques Based nt: 0.12 rank and click complex
integrated with Model) models for click models
click models better ranking
accuracy
"Unbiased OnlineRank Develop Unbiased CTR Address bias Handling

Learning to dataset unbiased Learning to improveme in online data sparsity
Rank in Online learning to Rank nt: 10% ranking due to challenges
Systems" rank models (ULTR) position and
for online user feedback
systems
"Learning to IR-Rank Improve LambdaMA Mean Demonstrate Exploration

Rank for dataset ranking in RT Average the of handling
Information information Precision effectiveness multilingual
Retrieval" retrieval tasks (MAP): of learning to queries
using learning 0.85 rank in
to rank information
techniques retrieval
"Learning to ImageRank Apply learning Siamese mAP Extend Evaluation

Rank for Image dataset to rank Network improveme learning to on diverse
Retrieval" methods to nt: 0.15 rank to non- image types
image retrieval textual data
tasks for image
retrieval
"Learning to RL- Investigate Deep Q- Click- Showcase the Model

Rank with Recommend reinforcement Network through rate potential of interpretabili
Reinforcement er dataset learning for (DQN) (CTR) reinforcement ty and user
Learning in personalized improveme learning in
Recommender recommendatio nt: 12% recommendati
Waqas (03-134202-090)
Labeeb Mirza (03-134202-109)
Systems" ns on tasks trust
"Learning to QuerySugge Develop a RankNet Precision@ Create a novel Exploration

Rank for Query st dataset query with Query 1: 0.9 approach for of real-time
Suggestion" suggestion Features query suggestions
system using suggestion
learning to
rank techniques
"Neural VisualRank Apply neural Convolutiona mAP Extend Scalability

Learning to dataset networks to l Neural improveme learning to for large
Rank for Visual learning to Networks nt: 0.2 rank to visual visual
Search" rank for visual (CNN) search, databases
search particularly
applications for e-
commerce
Task 2
1. Dataset downloading
2. Data Preprocessing
1 Collection and Organization:
 Collect and organize your dataset. Ensure that it contains labeled examples for
supervised learning tasks.
2 Data Exploration:
 Visualize a few sample images to understand the dataset's characteristics.
 Check for class imbalances or any other issues.
3 Resizing and Standardization:
 Resize images to a common resolution (e.g., 224x224 pixels) if they have
different sizes.
 Standardize pixel values (e.g., scaling to a range between 0 and 1 or -1 and 1).
4 Data Augmentation (Optional):
 Apply data augmentation techniques to increase the dataset's size and diversity.
This includes random rotations, flips, crops, and color adjustments.
5 Splitting the Dataset:
 Split your dataset into training, validation, and test sets. Common splits are 70-
80% for training, 10-15% for validation, and 10-15% for testing.
6 Data Loading:
Waqas (03-134202-090)
Labeeb Mirza (03-134202-109)
 Create data loaders or generators to efficiently load batches of images during
training.
7 One-Hot Encoding (for Classification):
 If you're doing classification, convert class labels into one-hot encoded vectors.
8 Handling Missing Data (if applicable):
 Check for missing images or labels and decide how to handle them (e.g., remove
or impute).
9 Data Normalization (if applicable):
 For some specific tasks, you might need to perform additional data normalization,
like z-score normalization for certain deep learning models.
10 Noise Removal (if applicable):
 Remove any noisy or irrelevant images from the dataset.
11 Data Balancing (if applicable):
 If you have class imbalances, consider techniques like oversampling or under
sampling to balance the dataset.
12 Feature Extraction (if applicable):
 For more advanced tasks, you may perform feature extraction using techniques
like SIFT, HOG, or deep learning-based feature extraction methods.
13 Saving Preprocessed Data:
 Save your preprocessed dataset in a suitable format (e.g., HDF5, TFRecord) for
easy access during training.
14 Data Visualization (Optional):
 Visualize the preprocessed data to ensure it looks correct and makes sense. Plot
sample images from different classes.
15 Documentation:
 Keep clear documentation of the preprocessing steps you've applied, including
any data augmentation techniques, resizing parameters, and normalization
methods used.
These are the links of datasets with available paper content, and you can use Google
Scholar as well as for relevant papers review of these datasets.
1. https://paperswithcode.com/datasets?mod=texts&lang=english&task=common-sense-
reasoning&page=1
2. https://paperswithcode.com/datasets?mod=texts&lang=english&task=mathematical-
reasoning&page=1
3. https://paperswithcode.com/datasets?mod=texts&lang=english&task=automated-theorem-
proving&page=1
4. https://paperswithcode.com/datasets?mod=texts&lang=english&task=low-resource-named-
entity-recognition&page=1
5. https://paperswithcode.com/datasets?mod=texts&lang=english&task=duplicate-question-
retrieval&page=1
6. https://paperswithcode.com/datasets?mod=texts&lang=english&task=new-product-sales-
forecasting&page=1
Waqas (03-134202-090)
Labeeb Mirza (03-134202-109)
7. https://paperswithcode.com/datasets?mod=texts&lang=english&task=low-resource-neural-
machine-translation&page=1
8. https://paperswithcode.com/datasets?mod=texts&lang=english&task=learning-to-rank&page=1
9. https://paperswithcode.com/datasets?mod=texts&lang=english&task=bias-detection&page=1
10. https://paperswithcode.com/datasets?mod=texts&lang=english&task=topic-
classification&page=1
11. https://paperswithcode.com/datasets?mod=texts&lang=english&task=opinion-mining&page=1
12. https://paperswithcode.com/datasets?mod=texts&lang=english&task=formal-logic&page=1
13. https://paperswithcode.com/datasets?mod=texts&lang=english&task=fairness&page=1
14. https://www.kaggle.com/datasets/theworldbank/data-resources-for-structural-economic-
analysis
15. https://www.kaggle.com/datasets/zillow/zecon
16. https://www.kaggle.com/datasets/madhurpant/world-economic-data
17. https://www.kaggle.com/datasets/theworldbank/global-economic-monitor
18. https://www.kaggle.com/datasets/davidwallach/financial-tweets
19. https://www.kaggle.com/datasets/shebrahimi/financial-distress
20. https://www.kaggle.com/datasets/sbhatti/financial-sentiment-analysis
21. https://www.kaggle.com/datasets/ealaxi/paysim1
22. https://www.kaggle.com/datasets/ankurzing/sentiment-analysis-for-financial-news

03 134202 109 11507886537 02012024 053617pm

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

03 134202 109 11507886537 02012024 053617pm

Uploaded by

Copyright:

Available Formats

Group: Muhammad Danish Shahid (03-134202-048)

Article Title Data-set Objective Techniques Results Contribution Gaps

"A Neural RankClick Develop a Neural CTR Introduce a Scalability

"Learning to eCommRan Adapt learning Gradient- Conversion Tailor Address

"Learning to ClickRank Investigate Click Models NDCG@5 Combine Need for

"Unbiased OnlineRank Develop Unbiased CTR Address bias Handling

"Learning to IR-Rank Improve LambdaMA Mean Demonstrate Exploration

"Learning to ImageRank Apply learning Siamese mAP Extend Evaluation

"Learning to RL- Investigate Deep Q- Click- Showcase the Model

Systems" ns on tasks trust

"Learning to QuerySugge Develop a RankNet Precision@ Create a novel Exploration

"Neural VisualRank Apply neural Convolutiona mAP Extend Scalability

You might also like