TCSE is a search engine specializing in exploring transcripts of TED Talk. It has been created for educational and scientific purposes. TCSE uses data provided by TED under the Creative Commons BY-NC-ND license, but it is not an official service of TED.
Change Log | Disclaimer | Documentation
TCSE is created by Yoichiro Hasebe at Doshisha University, Kyoto, Japan and made available free for non-commercial educational and scientific use. Please cite one of the following when you publish work which utilizes TCSE.
Hasebe, Yoichiro (2015) Design and Implementation of an Online Corpus of Presentation Transcripts of TED Talks. Procedia: Social and Behavioral Sciences 198(24), 174–182.
TCSE Version | 10.0.4 |
Date of talk data compilation | December 24, 2023 |
English POS-Tagger / Syntactic Parser | spaCy 3.72 |
Number of talks | 5,305 |
Number of segments | 1,164,910 |
Number of expanded segments | 546,023 |
Number of elements | 10,580,358 |
Number of lexical items | 98,922 |
Arabic | 5,230 talks |
Bulgarian | 2,281 talks |
Burmese | 1,586 talks |
Chinese, Simplified | 5,141 talks |
Chinese, Traditional | 4,897 talks |
Croatian | 2,023 talks |
Czech | 1,733 talks |
Dutch | 3,091 talks |
French | 5,130 talks |
German | 3,202 talks |
Greek | 3,140 talks |
Hebrew | 4,055 talks |
Hindi | 1,011 talks |
Hungarian | 3,479 talks |
Indonesian | 3,123 talks |
Italian | 4,641 talks |
Japanese | 4,299 talks |
Korean | 4,943 talks |
Kurdish | 1,288 talks |
Northern Kurdish | 1,116 talks |
Persian | 3,831 talks |
Polish | 3,624 talks |
Portuguese | 4,597 talks |
Portuguese, Brazilian | 4,859 talks |
Romanian | 3,668 talks |
Russian | 4,586 talks |
Serbian | 2,865 talks |
Slovak | 1,107 talks |
Spanish | 5,253 talks |
Swedish | 1,295 talks |
Thai | 2,175 talks |
Turkish | 4,673 talks |
Ukrainian | 2,258 talks |
Vietnamese | 4,618 talks |
How to skip to a specific segment
How to adjust sync between video and transcript
Sometimes video and transcript are not in sync for some reason. For such cases, the following solution is available on TCSE:
Advanced search is available only in English.
POS keys are specified either fully ({vb}
) or partially ({v}
).
An advanced search query string cannot consist only of POS keys.
Lemma | [LEMMA] |
Part of Speech | {POS} |
Surface + Part of Speech | SURFACE{POS} (with no spaces in-between) |
Lemma + Part of Speech | [LEMMA]{POS} (with no spaces in-between) |
Logical Disjunction (OR) | A|B |
Segment Onset (Beginning) | ^ |
Noun Chunk | _ |
Negative Match | -X |
Wild Card (matching exactly one element/word) | -_ |
Wild Card (matching variable length of strings) | * |
[excite] |
excite, excites, excited, exciting |
{noun} |
Noun, any kind |
{verb} |
Verb, any kind |
to * surprise |
to our surprise to his surprise, etc. |
[read] {det} [news|paper|article] |
they read these articles reading the paper or something I'm reading the news at six, etc. |
^ having {verb} |
Having started the process, Having said that, etc. |
[help]{noun} |
an aunt offered financial help, we called people for help, etc. |
[get] -rid of |
get outside of get ahead of got tired of, etc. |
[make] _ -_ |
made a bad design good. make this happen. make your life miserable., etc. |
[give] _ _ |
give you an example gave her a gift give the government any further excuse, etc. |