Publications by Shin Lee
ATA W14
Learning Objectives Describe a theoretical foundation of topic modeling methods Understand word co-occurrences as a way of analyzing topics Analyzing Topics Texts can be analyzed in terms of topics. This means we can analyze text in terms of what it is that it is written about in the first place. For example, consider how the topics covered in ...
13780 sym R (8973 sym/40 pcs) 4 img 1 tbl
DM_BD_W13
의미 연결망 분석 <출처: 김영우, “Do it! 쉽게 배우는 R 텍스트 마이닝”, 2021, 이지스퍼블리싱> 단어의 의미는 문장에 함께 사용된 단어에 따라 달라집니다. 단어의 빈도를 분석하면 중요한 단어가 무엇인지는 알 수 있지만, 단어가 어떤 맥락에서 사용됐는지는 알 ...
6783 sym R (14373 sym/57 pcs) 7 img
ITM: Week15
R을 이용한 토픽모델링 여성에 대한 미디어 담론 분석 1990년부터 현재까지 여성 담론 주제 변화 추적 빅카인즈 중앙지 기사 중 헤드라인에 “여성” 혹은 “여자” 키워드 포함 기사 131534여건 분석 1. 코퍼스 준비 library(tidyverse) ## -- Attaching packages -------------------...
2338 sym R (25263 sym/112 pcs) 9 img
BDJ21-W14-2
Toubleshooting How Do I Run the Selenium Server? rsDriver This function rsDriver manages the binaries needed for running a Selenium Server. If you have an error message like “Selenium message:session not created: This version of ChromeDriver only supports Chrome version 88”, then you may want to 1) set the “chromever” argument to be “9...
6679 sym R (3723 sym/18 pcs)
BDJ21-W14-1
RSelenium Basics https://cran.r-project.org/web/packages/RSelenium/vignettes/basics.html Introduction The goal of RSelenium is to make it easy to connect to a Selenium Server from within R. RSelenium provides R bindings for the Selenium Webdriver API. Selenium is a project focused on automating web browsers. RSelenium allows you to carry out uni...
5422 sym R (3587 sym/25 pcs) 1 tbl
BDJ21-W13-2
Why Selenium? How to use RSelenium? #install.packages("RSelenium") library(RSelenium) How Do I Connect to a Running Server? RSelenium has a main reference class named rsDriver. To connect to a server, you need to instantiate a new rsDriver with appropriate options. Now we load Selenium drivers and start Selenium. It may take time, so please wa...
3634 sym R (2274 sym/16 pcs)
BDJ21-W11-2
Web scraping in practice library(XML) library(rvest) library(stringr) Manipulating URLs to access multiple pages baseurl <- "https://news.daum.net/ranking/popular?regDate=" dates <- seq(from=20201025, to=20201031, by=1) urls <- str_c(baseurl, dates) urls ## [1] "https://news.daum.net/ranking/popular?regDate=20201025" ## [2] "https://news.d...
2083 sym R (27568 sym/100 pcs)
BDJ21-W12-1
If you want to knit this R Markdown to PDF, you may want to run the following R code, first: tinytex::install_tinytex() Review on web scraping library(XML) library(stringr) baseurl <- "https://movie.naver.com/movie/point/af/list.nhn?&page=" pages <- seq(from=1, to=1000, by=1) # Maximum of 1000 pages urls <- str_c(baseurl, pages) class(urls...
3304 sym R (3323 sym/13 pcs) 1 tbl
BDJ-W10-2
Web scraping in practice library(XML) library(rvest) library(stringr) page <- readLines("https://news.daum.net/ranking/popular") page_parsed <- htmlParse(page) https://news.daum.net/ranking/popular?regDate=20211108 First Headline XPath: ‘//[@id="mArticle"]/div[2]/ul[3]/li[1]/div[2]/strong/a’ Second Headline XPath: ’//[@id="mArticle"]/div...
1594 sym R (125207 sym/46 pcs)
BDJ21-W9-2
Regular Expression Last time, we learned some basic functions from the stringr package for handling and working with text in R. But in this course, we want to unleash the power of strings manipulation. So we are going to learn about regular expressions. What are Regular Expressions? The name “Regular Expression” does not say much. However, r...
13143 sym R (4411 sym/32 pcs)