Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

Flowchart for Bibliometric Analysis Dataset Preparation using biblioMagika® 2.

2
and biblioMagika®Split 1.5
by Aidi Ahmi

1. Getting the Datasets

Topic needs to be searchable.


Scope include the inclusion and
Not too general and not too
exclusion criteria. It can be
specific. You may need to Identify Topic & based on the database used,
search in Google Scholar if Scope time frame, documents types,
there are any studies have
subject area etc.
been conducted before.

The keywords and search


query must be comprehensive
enough to cover the topic.
Define the
Identify all the synonyms or
other words that being used by Keywords &
other scholars. Search Search Query
strategy should properly being
defined.

Scopus
Document
Search again
Results using EID

If you have any filtering options


i.e. to setup any inclusion and Make sure all the documents
exclusion criteria, you can do it Screen & Clean are really related to the topic of
here. the Results the study. Remove unrelated
documents.

Is the search
1. keyword has multiple meaning? Yes Save to Saved
2. involve the use of abbreviation? List
3. conducted within TITLE-ABS-KEY?

No
Make sure all the documents
produced by scopus are
related to the topic of the Screen & Clean
study. Remove unrelated the Results
documents. Remove
duplicates (if any).

Download the
Dataset

Scopus NEW Scopus OLD


Version Version

This file later once it


scopus.csv scopus.csv has been cleaned, it
Scopus Export Refine scopus.csv
(NEW - NOT (OLD - NOT can be combined with
Value.csv (TRUNCATED)
TRUNCATED) TRUNCATED) cleaned scopus.csv file
from biblioMagika
through OpenRefine.

All data from this file need All data from this file need This file is just for backup. The references column
to be copied to to be copied to For time being, need to be cleaned using
biblioMagika® biblioMagika® Biblioshiny cannot read OpenRefine
the references from this
dataset well.

1 6

Note: You should now have FOUR (4) files. Ensure that you backup all of these files.

biblioMagika® by Aidi Ahmi


2. Getting Started with biblioMagika®

Scopus Export Refine scopus.csv


Value.csv (TRUNCATED)
This file is in csv format.
CAN be opened using
Excel. The data from this
file need to be copied to
This file is just the biblioMagika® i.e. to
SUMMARY of dataset. Can conduct some extended
be used to conduct basic bibliometric analysis and do
bibliometric analysis. The the splitting for completing
data from this file need to the missing data and
be copied to harmonising the author,
biblioMagika®.

This can be done either


Sort by Cited by
before or after you
column from Largest to copy the data to
Smallest biblioMagika

Copy data to
biblioMagika®

Data Analysis Data Splitting

The first 5 of the analysis


can be conducted: Copy Authors
1. Basic info. and Affiliation
2. Pub. by Year Data from
3. Pub. by Sources
4. Highly Cited Doc.
5. Authorship Analysis

biblioMagika® by Aidi Ahmi


3. Working with biblioMagika®Split

Split using macro


SeperatedValue in Split
biblioMagika®Split Sheet. Make sure your
2 Split Data Excel has enable
Developer tab and Enable
Macro.

Copy all data from Split


Sheet to Cleaning
WorkSPACE Sheet.
Please make sure you
Copy Data
copy and paste it into the
right column. Copy only
from the column that have
data only.

Copy all formula from


Copy Formula Column G to Column Q
and apply it to all the data

Identify and
Complete the
MISSING DATA
Columns

Authors Full Authors with Single Name


Affiliation Country
Name Affiliation Author

Identify missing Identify missing Sort Single Author Sort Affiliation Column Sort Country Column
author's name. Write author's affiliation. Column and identify and check the blank and check the blank
dummy data such as You might need to if the author name is cell. Search by EID or cell. Make sure there
Author Name, Not search by EID or 1. If there is any, in Author's ID in Scopus or is no empty cell.
Available Author's ID in Scopus Authors Full Name Google the article title to
(123456789) if author or Google the article Column, just add the find the affiliation of the
name not available. to find the details dummy initital. For authors. Some affiliation
Check MANUALLY about author's example, if the names from Authors
all the names and affiliation including author name is just with Affiliation Column
make sure all names the country. Fill the Ahmi, just add the cannot be read in this
are standardised. missing data comma and intial column. Thus, it need to
accordingly. after the name to be copied manually.
become Ahmi, A.

Make sure all the missing


MISSING DATA data have been filled. At
this stage, dataset is still
COMPLETED not yet cleaned and
harmonised.

MANUAL DATA
CLEANING

biblioMagika® by Aidi Ahmi


4. Clean Author's Name and Affiliation Data using OpenRefine

Copy all data from


Cleaning Name the file as TO
WorkSPACE and CLEAN USING
paste it into New OPENREFINE.xlsx
Excel file

TO CLEAN USING
OPENREFINE.xlsx

Clean Authors Full Name


and Affiliation column Clean and
using Text Facet and all
Cluster Methods and Harmonise using
Functions and manual OpenRefine
cleaning.

Name the file as AFTER


CLEAN USING
Export the file to
OPENREFINE.xlsx Excel

AFTER CLEAN
USING
OPENREFINE.xlsx

Copy Authors Full


Name and Affiliation
Column back to
Cleaning
WorkSPACE

biblioMagika® by Aidi Ahmi


5. Further Data Exploration and Preparing Dataset for VOSviewer and Biblioshiny

Copy Cleaned Data Analysis (cont.)


Authors data into 1. Pub. by Authors
biblioMagika® 2. Lotka's Law

CLEANED
Copy Cleaned
AUTHOR'S NAME, Data Analysis (cont.)
Affiliation data into
AFFILIATION & 3. Pub. by Institutions
biblioMagika®
COUNTRY

Copy Cleaned Data Analysis (cont.)


Country data into 4. Pub. by Countries
biblioMagika®

Copy all data from


Copy all data from
Cleaned Authors
Join using JoinNEW Authors (Old) Sheet Join using JoinOLD
Sheet to Join_NEW
Macros. to Join_OLD Sheet Macros.
Sheet and joint split
and joint split data
data

1. Copy Author Full 3. Copy Authors column


Name column to to Scopus.csv (Truncated)
Scopus.csv (Truncated) 2. Copy Affiliation column
2. Copy Author with to Scopus.csv (Truncated)
Affiliation column to
Scopus.csv (Truncated)

Scopus.csv (WITH
CLEANED AUTHORS &
AFFILIATION).csv

Clean Authors Keywords


and Indexed Keywords Clean and
column using Text facet Harmonise
and all Cluster Methods Keywords using
and Functions and manual OpenRefine
cleaning.

VOSviewer
This file is ready to be
used in VOSviewer and Scopus.csv (WITH
Biblioshiny. But the
analysis limited to the CLEANED AUTHORS &
fields related to cleaned AFFILIATION &
authors info and keywords KEYWORDS).csv
only. References field still
have issues and need to Biblioshiny
be cleaned.

biblioMagika® by Aidi Ahmi


6. Combining Truncated File with References Data

6 5

Scopus.csv (WITH
scopus.csv
This file is in csv format. CLEANED AUTHORS
CANNOT be opened using
(OLD - NOT
& AFFILIATION &
Excel. TRUNCATED)
KEYWORDS).csv

Open in OpenRefine

Cleaning and harmonising


references (It's going to be Clean using Combine in
a long process). OpenRefine OpenRefine

Final dataset after it has


CLEANED scopus.csv been cleaned and
harmonised.

VOSviewer Biblioshiny

biblioMagika® by Aidi Ahmi

You might also like