Results 1 to 2 of 2

Hybrid View

Previous Post Previous Post   Next Post Next Post
  1. Newbie
    Student or Learner
    • Member Info
      • Native Language:
      • English
      • Home Country:
      • New Zealand
      • Current Location:
      • South Korea

    • Join Date: Oct 2011
    • Posts: 1

    Question Corpus L. Question: What counts as a word in BNC spoken?

    Hi there,
    I'm working with a list of words taken from transcripts, and want to compare them to the frequencies found in the BNC spoken, the list found at

    However I'm having a hard time trying to find the exact rules that were followed for defining 'word' in this corpus. For example, how did the BNC count multiword lexical items?

    From scrutinizing the list you can find multiwords like 'brand new', 'even when', 'by now' with their own frequency count, and yet you find 'new' listed as "NoP~" with the diacritic mark indicating that its part of a noun like 'New York'... (I think). It seems inconsistent to me.

    So if anybody can find BNC's guidelines for their spoken corpus online I'd be very grateful.

  2. 5jj's Avatar
    VIP Member
    Retired English Teacher
    • Member Info
      • Native Language:
      • British English
      • Home Country:
      • England
      • Current Location:
      • Czech Republic

    • Join Date: Oct 2010
    • Posts: 27,915

    Re: Corpus L. Question: What counts as a word in BNC spoken?

    I am not sure if this helps, but some of the links in it may.

    Multiwords and associated tags in BNC2
    Last edited by 5jj; 18-Oct-2011 at 15:29. Reason: typo

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts