TCSE is a search engine specializing in exploring transcripts of TED Talk. It has been created for educational and scientific purposes. TCSE uses data provided by TED under the Creative Commons BY-NC-ND license, but it is not an official service of TED.
Change Log | Disclaimer | Documentation
TCSE is created by Yoichiro Hasebe at Doshisha University, Kyoto, Japan and made available free for non-commercial educational and scientific use. Please cite one of the following when you publish work which utilizes TCSE.
Hasebe, Yoichiro (2015) Design and Implementation of an Online Corpus of Presentation Transcripts of TED Talks. Procedia: Social and Behavioral Sciences 198(24), 174–182.
TCSE Version | 10.0.2 |
Date of talk data compilation | December 24, 2022 |
English POS-Tagger / Syntactic Parser | spaCy 3.4 |
Number of talks | 4,938 |
Number of segments | 1,092,835 |
Number of expanded segments | 507,219 |
Number of elements | 9,906,358 |
Number of lexical items | 95,844 |
Arabic | 4,842 talks |
Bulgarian | 2,226 talks |
Burmese | 1,411 talks |
Chinese, Simplified | 4,714 talks |
Chinese, Traditional | 4,593 talks |
Croatian | 2,002 talks |
Czech | 1,665 talks |
Dutch | 2,975 talks |
French | 4,681 talks |
German | 2,968 talks |
Greek | 3,009 talks |
Hebrew | 3,841 talks |
Hindi | 801 talks |
Hungarian | 3,374 talks |
Indonesian | 2,619 talks |
Italian | 4,252 talks |
Japanese | 4,102 talks |
Korean | 4,491 talks |
Kurdish | 1,209 talks |
Northern Kurdish | 1,111 talks |
Persian | 3,612 talks |
Polish | 3,417 talks |
Portuguese | 4,237 talks |
Portuguese, Brazilian | 4,614 talks |
Romanian | 3,510 talks |
Russian | 4,284 talks |
Serbian | 2,775 talks |
Slovak | 1,103 talks |
Spanish | 4,853 talks |
Swedish | 1,221 talks |
Thai | 2,038 talks |
Turkish | 4,353 talks |
Ukrainian | 2,206 talks |
Vietnamese | 3,875 talks |
How to skip to a specific segment
How to adjust sync between video and transcript
Sometimes video and transcript are not in sync for some reason. For such cases, the following solution is available on TCSE:
Advanced search is available only in English.
POS keys are specified either fully ({vb}
) or partially ({v}
).
An advanced search query string cannot consist only of POS keys.
Lemma | [LEMMA] |
Part of Speech | {POS} |
Surface + Part of Speech | SURFACE{POS} (with no spaces in-between) |
Lemma + Part of Speech | [LEMMA]{POS} (with no spaces in-between) |
Logical Disjunction (OR) | A|B |
Segment Onset (Beginning) | ^ |
Noun Chunk | _ |
Negative Match | -X |
Wild Card (matching exactly one element/word) | -_ |
Wild Card (matching variable length of strings) | * |
[excite] |
excite, excites, excited, exciting |
{noun} |
Noun, any kind |
{verb} |
Verb, any kind |
to * surprise |
to our surprise to his surprise, etc. |
[read] {det} [news|paper|article] |
they read these articles reading the paper or something I'm reading the news at six, etc. |
^ having {verb} |
Having started the process, Having said that, etc. |
[help]{noun} |
an aunt offered financial help, we called people for help, etc. |
[get] -rid of |
get outside of get ahead of got tired of, etc. |
[make] _ -_ |
made a bad design good. make this happen. make your life miserable., etc. |
[give] _ _ |
give you an example gave her a gift give the government any further excuse, etc. |