Exploring the First 25 with Voyant

What is Voyant-Tools?

Voyant-Tools is an open-source text analysis environment available as a browser tool at voyant-tools.org or as a desktop tool that runs on a local server. It is very simple to load your corpus and get started and can handle a variety of file formats - plain text, PDF, HTML, XML, RDF, Word - which makes it a great platform for teaching and for gaining quick insights on your data.

The default “skin” contains five different visualization tools that interact with one another as you select different words and options, and there are many more tools available beyond the default. Visualizations are shareable and embeddable and you can also export your current view as a URL.

Word Frequencies

Voyant provides several different options for exploring word frequencies in your corpus. The word cloud (or “Cirrus”) and the line graph (or “Trends”) are two simple ways to visualize the most frequently occuring words, and the line graph provides the additional benefit of breaking word frequencies out by document to show differences among texts in your corpus.

Most frequent words in the first 25 cookbooks
Most frequent words in the first 25 cookbooks
Top 5 word frequencies in the first 25 cookbooks
Top 5 word frequencies in the first 25 cookbooks

Collocates

Frequency of 'cream' in first 25 cookbooks
Frequency of 'cream' in first 25 cookbooks
Most frequent co-occuring words with 'cream'
Most frequent co-occuring words with 'cream'

Entity Recognition

Entity recognition for the First Congregational Church of Forest City cookbook
Entity recognition for the First Congregational Church of Forest City cookbook

Document Clustering

Document similarity scatter plot, TF-IDF frequencies
Document similarity scatter plot, TF-IDF frequencies