Julia Silge is a data scientist at Stack Overflow. She enjoys making beautiful charts, the statistical programming language R, black coffee, red wine, and the mountains of her adopted home here in Utah.

Meer over de auteurs

Julia Silge, David Robinson

Text Mining with R

Name: Text Mining with R
Author: Julia Silge

A Tidy Approach

Paperback Engels 2017 1e druk 9781491981658

€ 46,53

In winkelwagen

Levertijd ongeveer 16 werkdagen

Gratis verzonden

Samenvatting

Much of the data available today is unstructured and text-heavy, making it challenging for analysts to apply their usual data wrangling and visualization tools. With this practical book, you’ll explore text-mining techniques with tidytext, a package that authors Julia Silge and David Robinson developed using the tidy principles behind R packages like ggraph and dplyr. You’ll learn how tidytext and other tidy tools in R can make text analysis easier and more effective.

The authors demonstrate how treating text as data frames enables you to manipulate, summarize, and visualize characteristics of text. You’ll also learn how to integrate natural language processing (NLP) into effective workflows. Practical code examples and data explorations will help you generate real insights from literature, news, and social media.

- Learn how to apply the tidy text format to NLP
- Use sentiment analysis to mine the emotional content of text
- Identify a document’s most important terms with frequency measurements
- Explore relationships and connections between words with the ggraph and widyr packages
- Convert back and forth between R’s tidy and non-tidy text formats
- Use topic modeling to classify document collections into natural groups
- Examine case studies that compare Twitter archives, dig into NASA metadata, and analyze thousands of Usenet messages

Specificaties

ISBN13:9781491981658

Trefwoorden:Programmeren, Programmeertalen, R, Text Mining

Taal:Engels

Bindwijze:paperback

Aantal pagina's:194

Uitgever:O'Reilly

Druk:1

Verschijningsdatum:27-6-2017

Hoofdrubriek:IT-management / ICT

Lezersrecensies

Wees de eerste die een lezersrecensie schrijft!

Schrijf een recensie

Uw waardering

?

Log in om uw waardering te geven

Klik om uw waardering te geven

Over Julia Silge

Julia Silge is a data scientist at Stack Overflow. She enjoys making beautiful charts, the statistical programming language R, black coffee, red wine, and the mountains of her adopted home here in Utah. She has a PhD in astrophysics and an abiding love for Jane Austen. Her work involves analyzing and modeling complex data sets while communicating about technical topics with diverse audiences.

Andere boeken door Julia Silge

Bekijk alle boeken

Over David Robinson

David Robinson is a data scientist at Stack Overflow. He has a PhD in Quantitative and Computational Biology from Princeton University, where he worked with Professor John Storey on genomic analysis. He enjoys working and blogging about statistics, R programming, and text mining, including a popular analysis of Donald Trump’s twitter account (performed according to the tidy data principles described in this book).

Andere boeken door David Robinson

Bekijk alle boeken

Inhoudsopgave

1. The tidy text format
2. Sentiment analysis with tidy data
3. Analyzing word and document frequency: tf-idf
4. Relationships between words: n-grams and correlations
5. Converting to and from non-tidy formats
6. Topic modeling
7. Case study: comparing Twitter archives
8. Case study: mining NASA metadata
9. Case study: analyzing usenet text

Aanbevolen live events

woensdag 30-09-2026

Jaarcongres Vrouwen met Impact

Seminar

Anderen die dit boek kochten, kochten ook

Max Kuhn

Tidy Modeling with R

€ 75,18
The Open Group

The TOGAF® Standard, 10th Edition - A Pocket Guide - 2025 Update

€ 23,94
Wim de Groot

Zo word je een Excel-Pro

€ 29,99
Van Haren Learning Solutions

ITIL® 5 Foundation Courseware

€ 43,60
Bas van Gils

Data Management: a gentle introduction

€ 54,45
Martijn Aslander

Verder met Obsidian

€ 29,99

Managementboek Top 100

Bekijk de volledige Managementboek Top 100

Uw winkelwagen

Text Mining with R

A Tidy Approach

Samenvatting

Specificaties

Lezersrecensies