Quantitative Text Analysis
  • Syllabus
  • Schedule
  • Slides
  • Guides
  • News
Categories
All (20)
acquire (1)
analysis (4)
api (2)
association (1)
central-tendency (1)
citations (1)
clone (1)
clustering (1)
co-occurrence analysis (1)
code (1)
code-blocks (1)
communication (1)
compressed-files (1)
computer science (1)
contribute (1)
control-statements (2)
corpora (1)
count() (1)
cross-references (1)
csv (1)
curate (1)
custom-functions (1)
data (5)
data science (1)
data-curation (1)
data-dictionary (1)
data-structures (1)
data-transformation (1)
data-types (1)
datasets (2)
dimensionality reduction (1)
dispersion (1)
distribution (1)
documentation (1)
download (1)
downloads (1)
dplyr (4)
figures (1)
fork (1)
frequency analysis (1)
front-matter (1)
fs (3)
functions (1)
geom_*() (1)
ggplot2 (2)
git (2)
github (3)
group_by() (1)
guenbergr (1)
guides (1)
infer (1)
inference (1)
information (2)
janitor (1)
kable() (1)
knitr (1)
linguistics (1)
machine-learning (1)
prediction (1)
process (1)
project (1)
prospectus (1)
purrr (2)
push (1)
qtalrkit (1)
quarto (4)
r (4)
readr (5)
readtext (1)
recipes (1)
regular-expressions (2)
reporting (1)
reproducible research (1)
reproducible-research (3)
research (4)
research-aim (1)
research-question (1)
research-statement (1)
resources (1)
rstudio (1)
sampling (1)
simulation (1)
skimr (2)
statistics (1)
stringr (2)
summarize() (1)
supervised-learning (1)
syllabus (1)
tables (1)
tables-of-contents (1)
tabyl() (1)
text-analysis (1)
tidy-data (3)
tidymodels (1)
tidyr (4)
web (1)
word embeddings (1)

Slides

The slide decks used in class are available here and are updated as the course progresses.

 

Contribute

“The reproducibility of studies and the ability to follow up on the work of others is key for innovation in science and engineering.”
—- Leland Wilkinson
communication
contribute
reporting
reproducible research
Apr 17, 2024

 

Infer

“People generally see what they look for, and hear what they listen for.”
— Harper Lee, To Kill a Mockingbird
analysis
inference
simulation
infer
Apr 10, 2024

 

Predict

“All models are wrong, but some are useful.”
— George E.P. Box
analysis
prediction
supervised-learning
machine-learning
text-analysis
tidymodels
Apr 3, 2024

 

Explore

“The data speaks for itself, but only if you are willing to listen.”
— Nate Silver”
analysis
frequency analysis
co-occurrence analysis
clustering
dimensionality reduction
word embeddings
Mar 27, 2024

 

Transformation

Prepare and enrich datasets
data
information
tidy-data
data-transformation
regular-expressions
datasets
stringr
purrr
tidyr
readr
fs
Mar 20, 2024

 

Taming data

The process of curating data
data
curate
process
data-dictionary
readr
dplyr
tidyr
readtext
qtalrkit
Mar 6, 2024

 

Curate

Data to information
data
information
tidy-data
data-curation
regular-expressions
datasets
stringr
purrr
tidyr
readr
fs
Mar 1, 2024

 

Harvesting research data

Acquiring research-aligned data
data
control-statements
custom-functions
downloads
api
Feb 28, 2024

 

Acquire

Source, acquisition, and documentation of data
acquire
download
api
data
web
functions
r
control-statements
csv
compressed-files
readr
fs
dplyr
tidyr
guenbergr
Feb 23, 2024

 

Project orientation

A guide to setup and provide an overview of the project prospectus
research
reproducible-research
quarto
git
github
project
prospectus
Feb 21, 2024

 

Scaffolding reproducible research

A template for reproducible research
research
reproducible-research
git
github
fork
clone
push
Feb 16, 2024

 

Research

Framing and scaffolding the research process
research
research-question
research-aim
research-statement
reproducible-research
Feb 14, 2024

 

Trace the datascape

Descriptive assessment of datasets
ggplot2
skimr
janitor
knitr
summarize()
group_by()
count()
tabyl()
kable()
geom_*()
Feb 9, 2024

 

Analysis

Approaching statistical thinking for text analysis.
central-tendency
dispersion
distribution
association
analysis
documentation
r
skimr
dplyr
ggplot2
Feb 7, 2024

 

Reading, inspecting, and writing datasets

First approach at combining Quarto and R
quarto
r
readr
dplyr
Feb 2, 2024

 

Data

Understanding data and information
sampling
corpora
tidy-data
data-structures
data-types
Jan 31, 2024

 

Academic writing with Quarto

Enabling more productive scholarly communication
quarto
front-matter
figures
tables
citations
tables-of-contents
cross-references
code
code-blocks
Jan 26, 2024

 

Text analysis in context

Where science, data, and linguistics meet.
data science
research
computer science
statistics
linguistics
Jan 24, 2024

 

Writing with code

An introduction to Literate Programming with R and Quarto
quarto
r
recipes
github
rstudio
Jan 19, 2024

 

Introductions

Course goals, the approach, and the resources
syllabus
resources
guides
Jan 17, 2024
No matching items