A concordancer is a computer program which is used to search through a corpus, in other words a collection of texts. This video reviews the four main online concordancers for academic English study: Lextutur; BNC; MICUSP; and SKELL.
▬ Contents of this video ▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
00:00 Intro
00:25 What is a concordancer?
02:12 Lextutor
07:43 BNC
14:07 MICUSP
16:53 SKELL
19:35 Summary (language)
21:02 Summary (features)
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Video created using VideoScribe software [affiliate link]:
https://tidd.ly/37VdIB7
Concordancers function like search engines, providing a list of sentences containing the search term or other information such as frequency. This allows the user to look for patterns, to see how common a word or phrase is, or to understand how the word is used, for example whether it commonly combines with particular words or phrases (i.e. collocations)
Concordancers search through a corpus (plural form corpora), which can consist of spoken or written texts, and can be collected according to many different principles. For example, there are corpora for newspaper articles, fiction, web pages, as well as for academic English (spoken and written). Some corpora have thousands of words, while others have millions or even billions of words.
Show More Show Less View Video Transcript
0:00
This video is brought to you by eapfoundation.com, the website for all your academic English needs
0:06
So this video looks at concordances for academic English study. There are two parts. First we'll see what a concordance is
0:14
And then we'll look at four online concordances. Namely, Legstuter, the BNC concordance, myCUSP and SCEL
0:25
First then, what is a concordance? A concordance is a computer program which is used to search through a corpus
0:33
A corpus is a collection of texts. The plural of the word is corpora
0:38
These texts can be grouped according to many different classifications. So we could have a corpus for newspaper articles, a corpus for fiction, a corpus for non-fiction texts
0:49
a corpus of web pages, as well as corpora for academic English
0:54
which could be spoken academic English, presentations, lectures, discussions, or written academic English, essays, reports and so on
1:02
And corpora vary in size. Some contain just a few thousand words
1:07
Some may contain millions of words. Very large corpora may even contain billions of words
1:13
A concordance functions very much like a search engine. But instead of a list of websites, it provides a list of sentences containing the search word
1:22
And this is useful in order for you to look for patterns, to see how common a word or phrase is
1:28
and to understand how a word or phrase is used. This video is going to be very practical
1:34
So I'm going to show you how to improve writing using concordances
1:38
To do this, we're going to start with this short example of a student piece of writing with some errors in it
1:44
Different from green energy, energy from coal will make many bad effects to the environment
1:50
Figure 1 is showing some of these effects. So the errors here are different from make, bad, to, and is showing
2:02
And we'll use each of those concordances to see how we can correct these errors
2:06
So let's begin by looking at the Legstuter concordance. So this is the Legstuter homepage
2:17
It has many different vocabulary tools, including a highlighter for the academic word list, which I reviewed in another video
2:23
So what we're interested in is the concordance. And you can see it has concordances for other languages besides English
2:29
So this is the interface for the Legstuter concordance. One of the great things about the Legstuter concordance is that there are many different corpora that you can use
2:38
So it defaults to the BAWE, the British Academic Written English corpus, which has 8 million words
2:44
There's also the BASE, the British Academic Spoken English corpus. There's a corpus of Academic Abstracts
2:50
There's the BNC, COCA, the corpus of Contemporary American English. There's a corpus for Jane Austen's works, Presidential Speeches, Shakespeare's works, and Wikipedia
3:00
The BAWE, which it defaults to, is a very nice corpus and the one I normally use, so we'll just leave it on that one
3:06
So let's start off by searching for the word effect and pressing Enter
3:10
There are other settings that we can change, but my experience with students is it's easier to change the settings after the first search
3:17
So that's what we'll do here. So we need to wait while it's assembling the data
3:21
OK, so here are the results. These are the concordance lines with the word effect
3:29
There are 2,788 in total. At the moment, these results are not sorted in any way
3:36
One thing I'm interested in is adjectives which can be used with effect. So let me search again, and this time sort by words to the left
3:45
So here are the results. We can see adverse effect is quite common
3:51
An effect. And effect. If we scroll down to the end of the page, we can actually see the relative frequencies of words which occur to the left of effect
4:04
This is one negative of the site, because this information would be much more useful at the top of the page
4:09
The most common adjective is negative, which occurs 63 times. And that has the same meaning as the word bad
4:15
There's also the word detrimental 26 times, and adverse 15 times. So those three adjectives might be a better alternative to the word bad
4:24
Something else we can do is we can look for pairs of words. In this box where it says plus a sock, which means associated with, I'm going to put the word negative
4:32
And next to that it says on left, and that's what we want. Words which are associated on the left with the word effect
4:41
So here you can see all the examples of negative with the word effect. And this should actually give us more information
4:46
So we can see here had a strong negative effect. Have the negative effect
4:50
Has a profound negative effect. Have some negative effect on, and so on
4:55
So we can now see that the correct verb to use with effect is have
4:59
Have an effect. How about the preposition which goes to the right
5:04
Well this time I'll keep the same search, but I will sort by words which occur to the right
5:09
We can see that of occurs a few times, but that's not the right sentence structure
5:14
The negative effect of. If we look for examples with a verb, we see the verb have a negative effect on
5:25
So in addition to have being the correct verb, we can see that the correct preposition is on
5:31
You can see the example extract is rather short. We can actually click on the keyword to see it in its full context
5:40
So how about the other problems that we had in the student writing? Well let's search this time for different from
5:48
And this time it needs to be associated on the right. Let's wait for the results
5:54
Okay, so here are the results. So scroll down to different from
5:59
The keyword, in this case different, is all in capitals, which makes it a little bit difficult to see the punctuation
6:05
But if we look at the example sentences closely, we can see that different from always occurs in the middle of a sentence
6:11
It never occurs at the beginning of a sentence. So that's the problem with that student's use of that transition
6:17
It doesn't occur at the beginning of a sentence. Let's also look for figure
6:23
Let's try searching for figure one. And this time it's a bit difficult to see the results
6:30
So let me change that search. Instead of searching for figure one, let me search for figure show
6:37
Because one of the good things about the LexTutor Concordance is it will match word forms, not just the exact word
6:44
So here we can see examples of figure with show. Figure one, nineteen, shows
6:49
Figure one, one, shows. Figure ten, shows. So we can see very quickly that we need to use the present simple tense here, shows
6:58
Not present continuous, it's showing. One other thing which I'll show you here very quickly is that we can also sort words according to the sub-corpus
7:08
In this case the frequency information is presented at the top of the page with better formatting, so this is much more useful
7:14
So these results are again for figure associated on the right with the word show
7:18
We can see this occurs most commonly in the engineering sub-corpus, 77 times
7:23
But also quite frequently in physics, 70 times, biology, 50 times, meteorology
7:28
So we can deduce from that that the word figure associated with the verb show is more common in science subjects than other disciplines
7:35
Next let's look at the BNC Concordance. BNC is short for British National Corpus and this was created by Oxford University Press
7:51
It's a corpus that contains 100 million words. This corpus covers a wide range of genres including spoken text, fiction, magazines, newspapers and academic texts
8:06
In order to use the BNC Concordance you first need to log in, which is a very simple process
8:12
To use the BNC Concordance you first need to log in, which means you first need to register for an account
8:17
Registration is free and fairly straightforward. So this is the registration page where you need to fill in some details such as your name, email address and so on
8:25
The free account has some limitations, the main one being you can only make 50 queries a day compared to 200 queries for a paid account
8:32
But for ordinary use this is going to be enough. So this is the homepage of the BNC Concordance
8:38
You can see the search area on the left and some information about it on the right. There are different things we can do with this concordance
8:43
You see a list, chart, collocates, we can compare words and KWIC which is Keyword in Context Display
8:50
So let's start off with list. Let's begin by searching again for the word effect
8:55
And you can see that this occurs 22,887 times. But this is in all the corpus and not just the academic section
9:03
If you want to conduct searches in the academic section then we need to choose that
9:07
So we click on section here and select academic. And then if I search again we see that there are now fewer results, 8099
9:17
If I click on the word effect here we can see some examples of the word in context
9:23
Next I'm going to look at the collocates tab. So this is useful for finding collocations of the keyword
9:30
And we can search up to four words to the left or four words to the right
9:36
For this search I'm just going to choose one word to the left
9:42
Again it's still on the academic section. So let me search. And these are the results
9:48
Here it shows frequency in the academic section as well as frequency overall
9:53
You can see this is very visual so it shows very quickly the relative frequency of these different collocates
9:58
Some of these are just simple words. It's no, give, into. But we can also see some adjective and noun collocations
10:05
Direct effect, significant effect, adverse effect. The adjective negative is actually quite low down, number 35, with a frequency of 18
10:16
That compares to adverse which is 8th in the list, a frequency of 50
10:21
So the collocation adverse effect is more frequent in the BNC than it was in the BAWE using LexTutor
10:30
And if I click on the word adverse I can see some examples of that collocation in some sentences
10:36
So now I'm going to do a keyword in context search. Let's sort the results by the word one to the left because I'm interested in adjective collocations with effect
10:48
So let me search and wait for the result. And this shows some rather interesting information
10:54
So it's colour coded with blue being the nouns, pink for verbs, yellow for prepositions
11:00
So I can see fairly quickly that the word have goes with the word effect
11:04
Can have an adverse effect on. And it's often followed by the word on which is shown in yellow
11:10
Having the desired effect on. Have the desired effect. Have a dramatic effect on
11:19
And so on. The chart tab is really useful for showing comparisons across different sections of the corpus
11:33
So let me show you an example for the word bad. OK, we need to wait a while
11:38
This is one of the features I don't like about this site. Which is unless you have a premium account it will sometimes pause and ask you to upgrade your account for a fee
11:48
So we need to wait a little while here. OK, let's continue
11:55
So here you can see that the word bad has the lowest frequency in the academic corpus
12:01
It's used much more frequently in non-academic text, newspapers, magazines and especially spoken English
12:08
Now let's see what adverse looks like in the different sections. So you can see that adverse is used most frequently in academic text
12:18
Much less frequently in non-academic text. And very rarely in fiction or spoken English
12:26
There are some other options that you can change here but I won't go into those because they are a bit technical and some of them need a premium account in order for you to change them
12:38
What you will do though is to look at some of those other examples. So let's look for the word different
12:43
Again a keyword in context search. This time I'll sort by words to the right
12:47
Because I'm interested in the phrase different from. So here you can see that unlike lextutor the formatting of the original is retained
12:56
So we have a small d if the word is in the middle of a sentence and a capital d if it's at the beginning of a sentence
13:04
So if I scroll down to different from. These examples here. You can see it's always a small d. It's always in the middle of a sentence
13:12
Is very different from. Are very different from. Let's look at the other example I had
13:19
So let's search for figure. Again one word to the right. Okay it's asking me to upgrade the account again
13:25
Let's try that again. Okay so here we are
13:35
So here we're interested in the verbs which follow. Which remember are in pink
13:39
So figure one shows. Figure one illustrates. Depicts. Shows. Illustrates. Presents. So again we can very easily see here that the present simple tense is used with the word figure
13:55
We can also see some interesting alternatives to the word shows. For example illustrates
14:03
Next let's look at the MICASP concordance. This concordance is used as a corpus of 2.6 million words from 829 student papers submitted to Michigan University
14:23
So it goes without saying that this is an academic corpus. As with the other concordances I'm going to begin by searching for the word effect
14:32
So this concordance gives us some very interesting information. The first thing it shows here is how often it occurs in different disciplines
14:38
This is the raw data. It's much more useful to show frequency per 10,000 words
14:43
We can see very clearly that the word effect occurs most commonly in the economics discipline
14:48
11.89 times per 10,000 words. It also occurs very commonly in physics
14:53
10.49 per 10,000 words. That's more than double the frequency of the next discipline
14:59
Which is 4.66. Something else which is unique to this corpus is showing the distribution across paper types
15:06
So we can see the word effect occurs most commonly in research papers, 39%, and reports, 37%
15:13
Something else which is different about this concordance is that instead of concordance lines it shows a very extended example
15:19
This definitely makes it more difficult to look at patterns and to find collocations
15:24
Nevertheless we can see that the word effect goes with the verb have and the preposition on
15:29
Have had a great effect on. If we search for the word figure we don't get any good results, at least not on the first page
15:37
Something we can do with this concordance though is to search for more than one word
15:41
So let's search for the phrase different from. And here we have lots of examples which all show that this is used in the middle of a sentence with the verb to be
15:50
This is quite different from. Can be very different from. Are different from
15:57
To be significantly different from. And so on. This corpus remember is made up of upper level student papers, in other words very good student writing
16:07
And we can actually filter our search in many different ways. So we can change the student levels from undergraduate to first, second and third year graduate
16:16
We can look for native or non-native speaker use. We can look for language in the abstract, definitions, literature review and so on
16:25
We can limit to just certain paper types like argumentative essay or report or research paper
16:31
And we can also limit our search to different disciplines, biology, economics and so on
16:38
So although this concordance has some drawbacks, it definitely has some features which the other ones lack
16:48
Finally let's look at the scale concordance. So scale is short for Sketch Engine for Language Learning
17:00
This is a free simplified interface of the more advanced and subscription only corpus tool Sketch Engine
17:06
The scale concordance uses its own corpus which comes from a range of genres
17:11
News, books, blogs, Wikipedia which accounts for 39%, other web pages which account for 31% and also the British National Corpus at 9%
17:23
And although this includes the academic corpus of the BNC, this is not an academic corpus
17:29
But it's still very useful for academic study as we'll see. So scale is definitely the easiest of the concordances to use
17:37
Again let's search for the word effect. So it immediately brings up examples of the word in context
17:43
These examples are very different from the ones that we've already seen because they're all very short
17:47
And there is always exactly 40 of them. So on the one hand this makes the examples very simple
17:53
On the other hand it makes them a bit limited. If we click on the Similar Words tab we have words which are similar to effect
17:59
Maybe the most useful part of scale is the word sketch area. So here we can see different words which associate with the keyword effect
18:06
So we have verbs with effect as the subject. So effect occurs
18:10
And if we hover over that word we can see more examples. Over here we have verbs with effect as object
18:16
And the one at the top the most frequent is have. So this shows us that have is the most common verb that goes with effect
18:23
Although interestingly it doesn't have the structure have an effect on. And then down here we have modifiers of effect
18:30
In other words adjectives which could be used with effect. So you can see that adverse is very high up in the list
18:36
Adverse effects. Negative is actually third in the list. Negative effects. Here you can see some other words which associate with effect
18:43
The main one being cause. Cause and effect. The difficulty lies in separating cause and effect
18:53
Let's try searching this time for figure. So this time we're interested in verbs with figure as a subject
19:00
So you can see show at the top. Figures show. Illustrate is also high in the list
19:06
The figure below illustrates operations of dynamic logic. So present simple again is the tense that's used
19:12
And at the end a verb we saw before. Depict. If I try searching for different I don't actually get any information that's useful
19:19
Even if I look at the example sentences I can't see the phrase different from
19:24
So scale is not so useful for finding that particular problem. And remember it's not an academic corpus
19:35
So let me now summarise some of the things that we've learned. I'll begin with a language summary
19:41
So we've seen that the phrase different from can't be used at the beginning of a sentence
19:45
It combines with the verb to be. To be different from. We've seen that the noun effect goes with the verb to have
19:52
To have an effect. And the preposition on. To have an effect on
19:57
We also saw that bad is not an academic word. Negative is a more academic adjective
20:03
To have a negative effect on. But we also saw that adverse and detrimental can be used instead
20:09
To have an adverse effect on. To have a detrimental effect on
20:13
And we also saw that verbs following figure one should use the present simple
20:18
Figure one shows. Although again we saw some possible alternatives. Illustrates. Figure one illustrates
20:26
Or depicts. Figure one depicts. So with that information we should now be able to correct the errors in the student writing
20:33
The first one is a bit difficult because although we saw that different from can't begin a sentence we didn't see any alternatives
20:39
The word unlike can be used here. Unlike green energy. But we did see how to fix the other problems
20:46
So instead of make we should use the verb have. Have many adverse effects
20:51
Or we could also say negative effects. On the environment. Figure one shows some of these effects
20:59
Or we could also say illustrates. Finally let's summarise the different features of these concordances
21:05
Lake Stuta. The BNC. MyCusp. And Skel. The good news is that all of these are free to use
21:13
The BNC requires registration first before you can use it but the other ones don't
21:18
Lake Stuta is maybe the most powerful because it has more than one corpus
21:22
Whereas the other ones use their own corpus. All of them use an academic corpus except for Skel
21:30
If you want to see how words are used in different disciplines. For example biology, engineering, economics, psychology
21:37
Only Lake Stuta and MyCusp will do that for us. If we're interested in looking at how words and phrases are used in different writing types
21:46
Like reports, argumentative essays and so on. Only MyCusp will do that
21:52
And all of them are useful for collocation study with the possible exception of MyCusp
21:56
Because the extracts are rather too long to do that simply. In short all of the concordances have features which make them useful for academic English study
22:06
For more information on concordances and other aspects of academic English. Please visit the website eapfoundation.com
22:12
As well as more detailed information about concordances. You can find a worksheet for use with this video and others on the channel
22:19
Visit eapfoundation.com Finally if you're interested in exploring corpora in more detail to improve academic writing
22:28
There's a great book on this topic. It's called Academic Writing with Corpora
22:34
A resource book for data-driven learning. This was written by Tatiana Karpenko-Sikkim and is published by Routledge
22:41
You can find a link in the description below. And there's also a review of the book on the eapfoundation.com website
22:48
And there's also a review of the book on the eapfoundation.com website
#Software
#General Reference
#Dictionaries & Encyclopedias
#Language Resources
#Technical Reference


