Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

15/6/22, 17:07 WEBSCRAPPING_GozmeAvila.

ipynb - Colaboratory

HugoVega_Articulos.csv

pip install beautifulsoup4
1 to 10 of 20 entries Filter
Looking in indexes: https://pypi.org/simple, ht Arcticulos Años
Requirement already satisfied: beautifulsoup4 i
Programación en N capas 2014
Sistema de apoyo a la generación
de horarios basado en algoritmos 2010
pip install bs4 genéticos
Integration of the enterprise
Looking in indexes: https://pypi.org/simple, ht information to facilitate decision 2021
Requirement already satisfied: bs4 in /usr/loca making
Requirement already satisfied: beautifulsoup4 i Classification algorithm based on
machine learning to optimize 2020
athletes talent detection
Data mart design to improve the
import requests decision-making process of the 2020
from bs4 import BeautifulSoup after-sales service
import pandas as pd Contributions to the Technological
Adoption Model for the Peruvian 2021
Agro-Export Sector
url = "https://scholar.google.es/citations?user=CfFN
Wearable Technology to Improve
Health Care Infants in the Yomibato 2020
Peruvian Community
peticion_HugoVega= requests.get(url)
Aplicación de las redes de Petri a la
2009
simulación discreta de sistemas
textourl = peticion_HugoVega.text Estudio y evaluación de los
sistemas de Recuperación de 2004
información
gbsoup = BeautifulSoup(textourl,'html.parser') Redes neuronales para el
reconocimiento de la calidad
morfológica de mangos exportables 2017
gbsoup.find(class_=property).get_text() para la empresa Biofruit del Perú
SAC
'HUGO FROILAN Vega Huerta - Google Scholarhtm
l,body,form,table,div,h1,h2,h3,h4,h5,h6,img,o Show 10 per page 1 2
l,ul,li,button{margin:0;padding:0;border:0;}ta
ble{border-collapse:collapse;border-width:0;em
pty-cells:show;}html,body{height:100%}#gs_top
{position:relative;box-sizing:border-box;min-h
eight:100%;min-width:964px;-webkit-tap-highlig
ht-color:rgba(0,0,0,0);}#gs_top>*:not(#x){-web

articulos = [i.get_text(strip=True) for i in gbsoup.f

print(articulos)

['Programación en N capas', 'Sistema de apoyo a

citaciones = [i.get_text(strip=True) for i in gbsoup

citaciones.remove('')

https://colab.research.google.com/drive/1aYVV2Iq9dLJUsligVSxwvYx-XQdboHd0#scrollTo=aaEZnnsTPWtY&printMode=true 1/2
15/6/22, 17:07 WEBSCRAPPING_GozmeAvila.ipynb - Colaboratory

citaciones.remove('Cited byCited by')

print(citaciones)

['21', '12', '10', '10', '8', '7*', '5', '5',

años = [i.get_text(strip=True) for i in gbsoup.find_

print(años)

['2014', '2010', '2021', '2020', '2020', '2021

gbdf_gbarticulos = pd.DataFrame({
    'Arcticulos': articulos,

    'Años': años

})

gbdf_gbarticulos.to_csv('HugoVega_Articulos.csv', ind

check 0 s completado a las 17:05

https://colab.research.google.com/drive/1aYVV2Iq9dLJUsligVSxwvYx-XQdboHd0#scrollTo=aaEZnnsTPWtY&printMode=true 2/2

You might also like