TCSE: Ted Corpus Search Engine

Recorded at	April 15, 2019
Event	TED2019
Duration (min:sec)	12:21
Video Type	TED Stage Talk
Words per minute	159.51 slow
Readability (FK)	68.06 very easy
Speaker	Doug Roble


1	00:13	Hello.
2	00:15	I'm not a real person.
3	00:17	I'm actually a copy of a real person.
4	00:19	Although, I feel like a real person.
5	00:22	It's kind of hard to explain.
6	00:24	Hold on -- I think I saw a real person ... there's one.
7	00:28	Let's bring him onstage.
8	00:33	Hello.
9	00:35	(Applause)
10	00:40	What you see up there is a digital human.
11	00:43	I'm wearing an inertial motion capture suit
12	00:46	that's figuring what my body is doing.
13	00:49	And I've got a single camera here that's watching my face
14	00:53	and feeding some machine-learning software that's taking my expressions,
15	00:58	like, "Hm, hm, hm,"
16	01:02	and transferring it to that guy.
17	01:05	We call him "DigiDoug."
18	01:09	He's actually a 3-D character that I'm controlling live in real time.
19	01:16	So, I work in visual effects.
20	01:19	And in visual effects,
21	01:20	one of the hardest things to do is to create believable, digital humans
22	01:26	that the audience accepts as real.
23	01:28	People are just really good at recognizing other people.
24	01:32	Go figure!
25	01:35	So, that's OK, we like a challenge.
26	01:39	Over the last 15 years,
27	01:40	we've been putting humans and creatures into film
28	01:45	that you accept as real.
29	01:48	If they're happy, you should feel happy.
30	01:51	And if they feel pain, you should empathize with them.
31	01:58	We're getting pretty good at it, too.
32	02:00	But it's really, really difficult.
33	02:03	Effects like these take thousands of hours
34	02:07	and hundreds of really talented artists.
35	02:10	But things have changed.
36	02:13	Over the last five years,
37	02:14	computers and graphics cards have gotten seriously fast.
38	02:20	And machine learning, deep learning, has happened.
39	02:25	So we asked ourselves:
40	02:27	Do you suppose we could create a photo-realistic human,
41	02:31	like we're doing for film,
42	02:33	but where you're seeing the actual emotions and the details
43	02:39	of the person who's controlling the digital human
44	02:43	in real time?
45	02:45	In fact, that's our goal:
46	02:47	If you were having a conversation with DigiDoug
47	02:51	one-on-one,
48	02:53	is it real enough so that you could tell whether or not I was lying to you?
49	02:59	So that was our goal.
50	03:02	About a year and a half ago, we set off to achieve this goal.
51	03:06	What I'm going to do now is take you basically on a little bit of a journey
52	03:10	to see exactly what we had to do to get where we are.
53	03:15	We had to capture an enormous amount of data.
54	03:20	In fact, by the end of this thing,
55	03:23	we had probably one of the largest facial data sets on the planet.
56	03:28	Of my face.
57	03:29	(Laughter)
58	03:32	Why me?
59	03:33	Well, I'll do just about anything for science.
60	03:36	I mean, look at me!
61	03:38	I mean, come on.
62	03:43	We had to first figure out what my face actually looked like.
63	03:49	Not just a photograph or a 3-D scan,
64	03:52	but what it actually looked like in any photograph,
65	03:56	how light interacts with my skin.
66	03:59	Luckily for us, about three blocks away from our Los Angeles studio
67	04:05	is this place called ICT.
68	04:07	They're a research lab
69	04:09	that's associated with the University of Southern California.
70	04:12	They have a device there, it's called the "light stage."
71	04:16	It has a zillion individually controlled lights
72	04:20	and a whole bunch of cameras.
73	04:22	And with that, we can reconstruct my face under a myriad of lighting conditions.
74	04:29	We even captured the blood flow
75	04:31	and how my face changes when I make expressions.
76	04:35	This let us build a model of my face that, quite frankly, is just amazing.
77	04:41	It's got an unfortunate level of detail, unfortunately.
78	04:45	(Laughter)
79	04:47	You can see every pore, every wrinkle.
80	04:50	But we had to have that.
81	04:52	Reality is all about detail.
82	04:55	And without it, you miss it.
83	04:58	We are far from done, though.
84	05:01	This let us build a model of my face that looked like me.
85	05:05	But it didn't really move like me.
86	05:08	And that's where machine learning comes in.
87	05:11	And machine learning needs a ton of data.
88	05:15	So I sat down in front of some high-resolution motion-capturing device.
89	05:20	And also, we did this traditional motion capture with markers.
90	05:25	We created a whole bunch of images of my face
91	05:28	and moving point clouds that represented that shapes of my face.
92	05:33	Man, I made a lot of expressions,
93	05:36	I said different lines in different emotional states ...
94	05:40	We had to do a lot of capture with this.
95	05:43	Once we had this enormous amount of data,
96	05:46	we built and trained deep neural networks.
97	05:51	And when we were finished with that,
98	05:52	in 16 milliseconds,
99	05:55	the neural network can look at my image
100	05:58	and figure out everything about my face.
101	06:02	It can compute my expression, my wrinkles, my blood flow --
102	06:07	even how my eyelashes move.
103	06:10	This is then rendered and displayed up there
104	06:13	with all the detail that we captured previously.
105	06:18	We're far from done.
106	06:20	This is very much a work in progress.
107	06:22	This is actually the first time we've shown it outside of our company.
108	06:25	And, you know, it doesn't look as convincing as we want;
109	06:29	I've got wires coming out of the back of me,
110	06:32	and there's a sixth-of-a-second delay
111	06:34	between when we capture the video and we display it up there.
112	06:38	Sixth of a second -- that's crazy good!
113	06:41	But it's still why you're hearing a bit of an echo and stuff.
114	06:46	And you know, this machine learning stuff is brand-new to us,
115	06:50	sometimes it's hard to convince to do the right thing, you know?
116	06:54	It goes a little sideways.
117	06:56	(Laughter)
118	06:59	But why did we do this?
119	07:03	Well, there's two reasons, really.
120	07:05	First of all, it is just crazy cool.
121	07:08	(Laughter)
122	07:09	How cool is it?
123	07:10	Well, with the push of a button,
124	07:13	I can deliver this talk as a completely different character.
125	07:17	This is Elbor.
126	07:22	We put him together to test how this would work
127	07:24	with a different appearance.
128	07:27	And the cool thing about this technology is that, while I've changed my character,
129	07:32	the performance is still all me.
130	07:35	I tend to talk out of the right side of my mouth;
131	07:38	so does Elbor.
132	07:39	(Laughter)
133	07:42	Now, the second reason we did this, and you can imagine,
134	07:44	is this is going to be great for film.
135	07:47	This is a brand-new, exciting tool
136	07:49	for artists and directors and storytellers.
137	07:55	It's pretty obvious, right?
138	07:56	I mean, this is going to be really neat to have.
139	07:59	But also, now that we've built it,
140	08:01	it's clear that this is going to go way beyond film.
141	08:05	But wait.
142	08:07	Didn't I just change my identity with the push of a button?
143	08:11	Isn't this like "deepfake" and face-swapping
144	08:14	that you guys may have heard of?
145	08:17	Well, yeah.
146	08:19	In fact, we are using some of the same technology
147	08:22	that deepfake is using.
148	08:23	Deepfake is 2-D and image based, while ours is full 3-D
149	08:28	and way more powerful.
150	08:31	But they're very related.
151	08:33	And now I can hear you thinking,
152	08:35	"Darn it!
153	08:36	I though I could at least trust and believe in video.
154	08:40	If it was live video, didn't it have to be true?"
155	08:44	Well, we know that's not really the case, right?
156	08:48	Even without this, there are simple tricks that you can do with video
157	08:52	like how you frame a shot
158	08:55	that can make it really misrepresent what's actually going on.
159	09:00	And I've been working in visual effects for a long time,
160	09:03	and I've known for a long time
161	09:05	that with enough effort, we can fool anyone about anything.
162	09:11	What this stuff and deepfake is doing
163	09:13	is making it easier and more accessible to manipulate video,
164	09:18	just like Photoshop did for manipulating images, some time ago.
165	09:25	I prefer to think about
166	09:26	how this technology could bring humanity to other technology
167	09:31	and bring us all closer together.
168	09:34	Now that you've seen this,
169	09:36	think about the possibilities.
170	09:39	Right off the bat, you're going to see it in live events and concerts, like this.
171	09:45	Digital celebrities, especially with new projection technology,
172	09:50	are going to be just like the movies, but alive and in real time.
173	09:55	And new forms of communication are coming.
174	09:59	You can already interact with DigiDoug in VR.
175	10:03	And it is eye-opening.
176	10:05	It's just like you and I are in the same room,
177	10:09	even though we may be miles apart.
178	10:12	Heck, the next time you make a video call,
179	10:15	you will be able to choose the version of you
180	10:18	you want people to see.
181	10:20	It's like really, really good makeup.
182	10:24	I was scanned about a year and a half ago.
183	10:29	I've aged.
184	10:30	DigiDoug hasn't.
185	10:32	On video calls, I never have to grow old.
186	10:38	And as you can imagine, this is going to be used
187	10:41	to give virtual assistants a body and a face.
188	10:44	A humanity.
189	10:45	I already love it that when I talk to virtual assistants,
190	10:48	they answer back in a soothing, humanlike voice.
191	10:51	Now they'll have a face.
192	10:53	And you'll get all the nonverbal cues that make communication so much easier.
193	11:00	It's going to be really nice.
194	11:01	You'll be able to tell when a virtual assistant is busy or confused
195	11:05	or concerned about something.
196	11:09	Now, I couldn't leave the stage
197	11:12	without you actually being able to see my real face,
198	11:14	so you can do some comparison.
199	11:18	So let me take off my helmet here.
200	11:20	Yeah, don't worry, it looks way worse than it feels.
201	11:25	(Laughter)
202	11:29	So this is where we are.
203	11:30	Let me put this back on here.
204	11:32	(Laughter)
205	11:35	Doink!
206	11:37	So this is where we are.
207	11:39	We're on the cusp of being able to interact with digital humans
208	11:43	that are strikingly real,
209	11:45	whether they're being controlled by a person or a machine.
210	11:49	And like all new technology these days,
211	11:54	it's going to come with some serious and real concerns
212	11:59	that we have to deal with.
213	12:02	But I am just so really excited
214	12:04	about the ability to bring something that I've seen only in science fiction
215	12:09	for my entire life
216	12:11	into reality.
217	12:13	Communicating with computers will be like talking to a friend.
218	12:18	And talking to faraway friends
219	12:20	will be like sitting with them together in the same room.
220	12:24	Thank you very much.
221	12:26	(Applause)

Doug Roble: Digital humans that look just like us