R 爬取獵聘網招聘資料

數據獲取的渠道:企業內部,機臺,網站 etc..., 現使用R的rvest包進行網絡數據的爬取

code:

library(rvest)

library(stringr)

library(tidyr)

library(dplyr)

url

page

# get positon

position%html_nodes("ul.sojob-list div.sojob-item-main div.job-info h3")%>%html_text(trim=TRUE)

p%data.frame()

names(p)

# get area

area%html_nodes("ul.sojob-list div.sojob-item-main div.job-info a.area")%>%html_text(trim=TRUE)

a%data.frame()

names(a)

# 提取了salary education experience

experience%html_nodes("ul.sojob-list div.sojob-item-main div.job-info p.condition span")%>%html_text(trim=TRUE)

names(e)

dt

colnames(dt)

for(n in 1:3){

j

i

while(i

dt[j,n]

j

i

}

}

# 循環查詢

for(n in 1:10){

url

page

position%html_nodes("ul.sojob-list div.sojob-item-main div.job-info h3")%>%html_text(trim=TRUE)

p1%data.frame()

names(p1)

p

#area

area%html_nodes("ul.sojob-list div.sojob-item-main div.job-info a.area")%>%html_text(trim=TRUE)

a1%data.frame()

names(a1)

a

#experience

experience%html_nodes("ul.sojob-list div.sojob-item-main div.job-info p.condition span")%>%html_text(trim=TRUE)

dt1

colnames(dt1)

for(d in 1:3){

j

i

while(i<=length(e)){

dt[j,d]

j

i

}

}

dt

}

work_info

}

# 數據可視化:

略...


分享到:


相關文章: