TCSE is a search engine specializing in exploring transcripts of TED Talk. It has been created for educational and scientific purposes. TCSE uses data provided by TED under the Creative Commons BY-NC-ND license, but it is not an official service of TED.
Change Log | Disclaimer | Documentation
TCSE is created by Yoichiro Hasebe at Doshisha University, Kyoto, Japan and made available free for non-commercial educational and scientific use. Please cite one of the following when you publish work which utilizes TCSE.
Hasebe, Yoichiro (2015) Design and Implementation of an Online Corpus of Presentation Transcripts of TED Talks. Procedia: Social and Behavioral Sciences 198(24), 174–182.
TCSE Version | 10.0.3 |
Date of talk data compilation | July 30, 2023 |
English POS-Tagger / Syntactic Parser | spaCy 3.6 |
Number of talks | 5,133 |
Number of segments | 1,131,206 |
Number of expanded segments | 529,044 |
Number of elements | 10,269,950 |
Number of lexical items | 97,643 |
Arabic | 5,049 talks |
Bulgarian | 2,271 talks |
Burmese | 1,494 talks |
Chinese, Simplified | 4,979 talks |
Chinese, Traditional | 4,849 talks |
Croatian | 2,022 talks |
Czech | 1,706 talks |
Dutch | 3,042 talks |
French | 4,898 talks |
German | 3,112 talks |
Greek | 3,119 talks |
Hebrew | 3,997 talks |
Hindi | 891 talks |
Hungarian | 3,456 talks |
Indonesian | 2,960 talks |
Italian | 4,422 talks |
Japanese | 4,215 talks |
Korean | 4,824 talks |
Kurdish | 1,239 talks |
Northern Kurdish | 1,114 talks |
Persian | 3,758 talks |
Polish | 3,573 talks |
Portuguese | 4,472 talks |
Portuguese, Brazilian | 4,811 talks |
Romanian | 3,634 talks |
Russian | 4,506 talks |
Serbian | 2,843 talks |
Slovak | 1,107 talks |
Spanish | 5,083 talks |
Swedish | 1,259 talks |
Thai | 2,133 talks |
Turkish | 4,562 talks |
Ukrainian | 2,224 talks |
Vietnamese | 4,310 talks |
How to skip to a specific segment
How to adjust sync between video and transcript
Sometimes video and transcript are not in sync for some reason. For such cases, the following solution is available on TCSE:
Advanced search is available only in English.
POS keys are specified either fully ({vb}
) or partially ({v}
).
An advanced search query string cannot consist only of POS keys.
Lemma | [LEMMA] |
Part of Speech | {POS} |
Surface + Part of Speech | SURFACE{POS} (with no spaces in-between) |
Lemma + Part of Speech | [LEMMA]{POS} (with no spaces in-between) |
Logical Disjunction (OR) | A|B |
Segment Onset (Beginning) | ^ |
Noun Chunk | _ |
Negative Match | -X |
Wild Card (matching exactly one element/word) | -_ |
Wild Card (matching variable length of strings) | * |
[excite] |
excite, excites, excited, exciting |
{noun} |
Noun, any kind |
{verb} |
Verb, any kind |
to * surprise |
to our surprise to his surprise, etc. |
[read] {det} [news|paper|article] |
they read these articles reading the paper or something I'm reading the news at six, etc. |
^ having {verb} |
Having started the process, Having said that, etc. |
[help]{noun} |
an aunt offered financial help, we called people for help, etc. |
[get] -rid of |
get outside of get ahead of got tired of, etc. |
[make] _ -_ |
made a bad design good. make this happen. make your life miserable., etc. |
[give] _ _ |
give you an example gave her a gift give the government any further excuse, etc. |