Results 1 to 3 of 3
    • Member Info
      • Member Type:
      • Student or Learner
      • Native Language:
      • American English
      • Home Country:
      • United States
      • Current Location:
      • United States

    • Join Date: Mar 2016
    • Posts: 4
    • Post Thanks / Like
    #1

    Analyzing Large Volume of Text

    Hello,

    I used the 20 included analysis entries in the "Advanced Text" feature on the site; problem is, I have about 8,000 ~500 word essays to analyze. Does anyone know of a tool that would be effective for this purpose?

    Ideally, the tool would have analysis aggregated by date, author, etc.

    Thanks!

    • Member Info
      • Member Type:
      • Student or Learner
      • Native Language:
      • American English
      • Home Country:
      • United States
      • Current Location:
      • United States

    • Join Date: Mar 2016
    • Posts: 4
    • Post Thanks / Like
    #2

    Re: Analyzing Large Volume of Text

    Quote Originally Posted by jbuccola View Post
    Hello,

    I used the 20 included analysis entries in the "Advanced Text" feature on the site; problem is, I have about 8,000 ~500 word essays to analyze. Does anyone know of a tool that would be effective for this purpose?

    Ideally, the tool would have analysis aggregated by date, author, etc.

    Thanks!
    ... or a web service / API I can invoke to get similar analysis and handle the aggregation on my end.

    • Member Info
      • Member Type:
      • Student or Learner
      • Native Language:
      • American English
      • Home Country:
      • United States
      • Current Location:
      • United States

    • Join Date: Mar 2016
    • Posts: 4
    • Post Thanks / Like
    #3

    Re: Analyzing Large Volume of Text

    After much digging, I found Readability-Score.com -- which has a very accessible interface to upload URLs in bulk and even PHP source code if one wanted to do a deeper integration.

    I was able to upload thousands of URLs in CSV/Excel format, and the service quickly provided readability scoring in a file emailed to me once processed. The service averaged 4 URLs per second, providing the following fields appended to the provided list:


    • Flesch-Kincaid Reading Ease
    • Flesch-Kincaid Grade Level
    • Gunning-Fog Score
    • Coleman-Liau Index
    • SMOG Index
    • Automated Readability Index
    • Average Grade Level
    • Character Count
    • Syllable Count
    • Word Count
    • Sentence Count
    • Characters per Word
    • Syllables per Word
    • Words per Sentence
    • Letters per Word




    I appended a few key fields to allow me to aggregate and sort results (date and author, for example) and am analyzing the data via Microsoft's excellent (and free) PowerBI tool.

    Here's an example visualization of various readability indices charted by year (average grade level is the bar graph, Year is the X axis, Grade Level is the Y axis). You can see grade level escalated from 6th grade to 12th+ over the course of time for the same cohort of authors:



    Thanks to this site for getting me started!

Similar Threads

  1. [Vocabulary] a large number of/ large numbers of + plural/ sinhular/ uncountable
    By wotcha in forum Ask a Teacher
    Replies: 2
    Last Post: 27-Mar-2012, 10:13
  2. [General] Pumas are large animals vs A puma is a large animal
    By cubezero3 in forum Ask a Teacher
    Replies: 2
    Last Post: 28-Jan-2011, 18:36
  3. a large number of and large numbers of
    By chrysanthemum in forum Ask a Teacher
    Replies: 2
    Last Post: 08-Oct-2010, 20:45
  4. [Vocabulary] sales / volume sold / sales volume
    By uktous in forum Ask a Teacher
    Replies: 1
    Last Post: 05-Jan-2010, 16:49
  5. Text Analyzing Tool
    By Fame in forum Text Analysis and Statistics
    Replies: 2
    Last Post: 29-Aug-2006, 21:05

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •