Programming for Corpus Linguistics with Python and Dataframes

Cambridge University Press
SKU:
9781108822589
|
ISBN13:
9781108822589
$26.08
(No reviews yet)
Condition:
New
Usually Ships in 24hrs
Current Stock:
Estimated Delivery by: | Fastest delivery by:
Adding to cart… The item has been added
Buy ebook
This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.


  • | Author: Daniel Keller
  • | Publisher: Cambridge University Press
  • | Publication Date: Jun 20, 2024
  • | Number of Pages: NA pages
  • | Language: English
  • | Binding: Paperback
  • | ISBN-10: 1108822584
  • | ISBN-13: 9781108822589
Author:
Sajjad Adeliyan Tous, James T. Richardson
Publisher:
Cambridge University Press
Publication Date:
Jun 13, 2024
Number of pages:
NA pages
Language:
English
Binding:
Paperback
ISBN-10:
1009460072
ISBN-13:
9781009460071