{"id":74,"date":"2013-02-01T03:54:10","date_gmt":"2013-02-01T11:54:10","guid":{"rendered":"http:\/\/blogs.scu.edu\/anouaux\/?p=74"},"modified":"2013-02-02T20:20:37","modified_gmt":"2013-02-03T04:20:37","slug":"corpus-linguistics","status":"publish","type":"post","link":"https:\/\/blogs.scu.edu\/anouaux\/2013\/02\/01\/corpus-linguistics\/","title":{"rendered":"Corpus Linguistics"},"content":{"rendered":"<p>Svenja Adolphs, a Lecturer in Applied Linguistics at the University of Nottingham, provides a guide to the area of corpus linguistics in her book\u00a0<em>Introducing Electronic Text Analysis<\/em>. \u00a0 Corpus linguistics extracts patterns from text to help us gain an understanding of the governing rules and interconnectedness of language.\u00a0 Her work also looks at how electronic texts and analysis software are &#8220;being utilized by researchers in a range of diverse areas in the arts and humanities and in the social sciences.&#8221;\u00a0\u00a0\u00a0 It&#8217;s hard to believe that this area of study was first conducted before computers were around.\u00a0 Corpus linguistics seeks to bring order to and make sense of the breadth of information and diverse use of language available to us.<\/p>\n<p>One of the methods Adolphs&#8217; presents us with is concordance data.\u00a0 The Key Word in Context (KWIC) concordance takes a body of text and examines certain words and phrases.\u00a0 &#8220;A concordance is a way of presenting language data to facilitate analysis&#8221; (Adolphs, 5).\u00a0 However, language is complex because you can have multiple meanings for certain words and those meanings change over time.\u00a0 The output format in a KWIC concordance helps us to analyze a word in context.\u00a0 This greatly affects how we interpret historical texts like the Bible, Shakespeare, and even the <a href=\"http:\/\/http:\/\/law2.umkc.edu\/faculty\/projects\/ftrials\/conlaw\/interp.html\">US Constitution<\/a>.<\/p>\n<div id=\"attachment_78\" style=\"width: 250px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/blogs.scu.edu\/anouaux\/files\/2013\/02\/300px-Writing_the_Declaration_of_Independence_1776_cph.3g099041.jpg\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-78\" class=\" wp-image-78 \" alt=\"300px-Writing_the_Declaration_of_Independence_1776_cph.3g099041\" src=\"http:\/\/blogs.scu.edu\/anouaux\/files\/2013\/02\/300px-Writing_the_Declaration_of_Independence_1776_cph.3g099041.jpg\" width=\"240\" height=\"320\" srcset=\"https:\/\/blogs.scu.edu\/anouaux\/files\/2013\/02\/300px-Writing_the_Declaration_of_Independence_1776_cph.3g099041.jpg 300w, https:\/\/blogs.scu.edu\/anouaux\/files\/2013\/02\/300px-Writing_the_Declaration_of_Independence_1776_cph.3g099041-225x300.jpg 225w\" sizes=\"auto, (max-width: 240px) 100vw, 240px\" \/><\/a><p id=\"caption-attachment-78\" class=\"wp-caption-text\">The original intent\/meaning of the US Constitution has been hotly debated in our Supreme Court.<\/p><\/div>\n<p>Researchers greatly benefit from existing corpora such as the <em>Bank of English<\/em> corpus, &#8220;which exceeds 500 million words at the time of writing&#8221; (Adolphs, 18).\u00a0 An extensive body of corpus grants more possibilities to researchers.\u00a0 In general, drawing from a larger body of data is more scientific and adds &#8220;to the robustness of the analytical results&#8221; (Adolphs, 19).\u00a0 Various qualitative and quantitative methods have been devised to &#8220;provide a way into the data that is informed by the data itself&#8221; (19).<\/p>\n<p>The type-token ratio examines lexical density and &#8220;can be useful when assessing the level of complexity of a particular text or text collection&#8221; (Adolphs, 39-40).\u00a0 While this method gives us a basic insight to a text and maybe helpful in organizing, a closer examination of the words and phrases is required to make any sort of concrete evaluation of the complexity of the text.<\/p>\n<p>Examining the words used in text or even spoken conversation can yield invaluable information to those in research as well as professional fields.\u00a0 Studying wordlists reveal the frequency of certain key words or phrases used.<\/p>\n<blockquote><p>In political science it may be the comparison of linguistic devices used by different political parties, for example in the context of election campaign discourse.<\/p><\/blockquote>\n<p>The frequency of words observed in wordlists are expressed as ratios within the body of a text.\u00a0 This is useful because texts vary in size, and ratios &#8220;provide a better basis for comparison of frequencies of individual items&#8221; (Adolphs, 43).\u00a0 The CANCODE corpora represents a list of general spoken English.\u00a0 In Adolphs&#8217; comparison of the corpora of Health Professional (HP) and the CANCODE corpus the frequency of positive keywords reveal that the HP corpora are more geared towards speaking in the present tense.\u00a0 This comparison to the CANCODE corpora can help us form hypothesis about the nature of the health profession based on its lexicon.<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Svenja Adolphs, a Lecturer in Applied Linguistics at the University of Nottingham, provides a guide to the area of corpus linguistics in her book\u00a0Introducing Electronic Text Analysis. \u00a0 Corpus linguistics extracts patterns from text to help us gain an understanding &hellip; <a href=\"https:\/\/blogs.scu.edu\/anouaux\/2013\/02\/01\/corpus-linguistics\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":401,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"qubely_global_settings":"","qubely_interactions":"","kk_blocks_editor_width":"","_kiokenblocks_attr":"","_kiokenblocks_dimensions":"","footnotes":""},"categories":[1],"tags":[],"class_list":["post-74","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"gutentor_comment":6,"qubely_featured_image_url":null,"qubely_author":{"display_name":"anouaux","author_link":"https:\/\/blogs.scu.edu\/anouaux\/author\/anouaux\/"},"qubely_comment":6,"qubely_category":"<a href=\"https:\/\/blogs.scu.edu\/anouaux\/category\/uncategorized\/\" rel=\"category tag\">Engl 16 - Blog Posts<\/a>","qubely_excerpt":"Svenja Adolphs, a Lecturer in Applied Linguistics at the University of Nottingham, provides a guide to the area of corpus linguistics in her book\u00a0Introducing Electronic Text Analysis. \u00a0 Corpus linguistics extracts patterns from text to help us gain an understanding &hellip; Continue reading &rarr;","post_mailing_queue_ids":[],"_links":{"self":[{"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/posts\/74","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/users\/401"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/comments?post=74"}],"version-history":[{"count":6,"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/posts\/74\/revisions"}],"predecessor-version":[{"id":76,"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/posts\/74\/revisions\/76"}],"wp:attachment":[{"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/media?parent=74"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/categories?post=74"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.scu.edu\/anouaux\/wp-json\/wp\/v2\/tags?post=74"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}