Yoichiro Hasebe (長谷部 陽一郎) is an associate professor at the Faculty of Global Communications, Doshisha University (同志社大学). His research interests include cognitive linguistics, comparative linguistics, and corpus linguistics. He is also an avid enthusiast of web application programming, working on several projects of developing online systems that aid educators and learners of language.
See also: https://github.com/yohasebe
A versatile corpus system to retrieve video/text segments from over 2,300 TED Talks
RSyntaxTree is a graphical syntax tree generator written in the Ruby programming language.
WP2TXT extracts plain text data from Wikipedia dump file stripping all the MediaWiki markups and other metadata.
EngTagger is a probability based, corpus-trained tagger that assigns POS tags to English text based on a lookup dictionary and a set of probability values.
A tutorial document to learn how to use COCA and other BYU corpora [in Japanese]
at Doshisha University