Robot Dexterity Still Seems Hard

Brian Potter

Apr 24

207

You can’t throw a rock these days without hitting someone trying to build humanoid robots.

Read →

52 Comments

Brian Villanueva

Apr 24Edited

I teach robotics to HS students. "Manipulation is the problem" is something we talk about quite a bit. I show them the old DARPA challenge videos and they get a laugh at robots trying to just open a door. Robots have gotten better, but simple tasks still baffle them. Why? Tactile sensor density.

Computer vision has gotten very good: high resolution, AI pattern rec, the robot knows what's around it. But manipulation isn't visual. It's tactile, and tactile sensor density simply hasn't kept up. Human tactile resolution in the fingertip is about 1/2mm. And that's not just binary -- "am I touching something?". It's quite complex: "How much pressure?"; "Is it hot or cold?"; "Is it hard or soft?" The human palm is less sensor dense, but still far denser than any robotic fingertip. This is a biggest limitation to humanoid robots today: they can't feel. (And I don't mean emotions.)

You're also correct that dexterity and strength are largely a tradeoff and probably always will be. Strength requires higher power servos, but their larger size limits how many of them can be put in something small like a finger. Also, the higher pressures of lifting heavy things tend to damage the tactile sensors needed for more dexterous applications. This is likely unsolvable but won't matter long-term though. Robots will become cheap enough they will be specialized. The unit that can crack eggs for an omelet doesn't need to lift 50 pounds.

Vision is there. AI is there. Servos are close. But tactile sensors are the biggest hurdle. There's lots of folks working on this, and it's going to get there. But it's not there yet.

Expand full comment

Reply (4)

Russell Hawkins

Apr 25

Is vision actually "there" though? I was reading recently about the surprising persistence of human radiologists, and I was surprised by how much the problems candidate AI replacements had sounded like the the ones they had in the early 2010s. Issues like not being able to handle small differences images from different machines, and accidentally overfitting to features of the images in the training data that aren't a part of the actual scan.

Also, everyone in self-driving cars (except Tesla) is relying heavily on lidar.

I'm not sure how these facts fit in with the obviously massive improvements in object categorization and facial recognition, but I wonder if the visual aspect of dexterous manipulation might also still be a huge challenge.

Expand full comment

Reply (1)

Brian Villanueva

Apr 28

My understanding is that computer vision is there. And that makes sense, since it's really fairly basic pattern rec. It may sound weird to non geeks to say "computer vision is easy", but the great AI renaissance of the last 10 years has been in pattern rec. Failures of vision today are either too limited or weak training data. I still think the radiologists are doomed.

Lidar is used because it's faster and more reliable at distance measurements than binocular computer vision, even if the eyeballs are on opposite sides of the car.

Expand full comment