Show HN: Controlling 3D models with voice and hand gestures

github.com

97 points by getToTheChopin 2 days ago

I'm sharing my project to control 3D models with voice commands and hand gestures:

- use voice commands to change interaction mode (drag, rotate, scale, animate)

- use hand gestures to control the 3D model

- drag/drop to import other models (only GLTF format supported for now)

Created using threejs, mediapipe, web speech API, rosebud AI, and Quaternius 3D models

Githhub repo: https://github.com/collidingScopes/3d-model-playground

Demo: https://xcancel.com/measure_plan/status/1929900748235550912

I'd love to get your feedback! Thank you

corysama 2 days ago

I understand you need your face in the videos for the demos. But, want to mention that you should make sure your system works with your hands in your lap. As shown, the user is going to experience "gorilla arm" fatigue very quickly.

getToTheChopin 2 days ago

Good points, maybe a second camera (phone?) pointed downwards at the tabletop would be good for that. Then the user can rest their hands in a "normal" position.
Thank you for the feedback!

getToTheChopin 2 days ago

Here's a quick video demo showing how it works: https://x.com/measure_plan/status/1929900748235550912

pvg 2 days ago

https://xcancel.com/measure_plan/status/1929900748235550912 for the tweetless
ActionHank 2 days ago

[flagged]

doakes 2 days ago

Very cool! I like the different modes. I've always been fascinated with this space and products like Leap Motion: https://www.youtube.com/watch?v=zXghYjh6Gro

getToTheChopin 2 days ago

It's a shame that computer vision tech like Leap / Eyetoy / Kinect didn't have lasting power. So much fun to build with

chimpansteve 2 days ago

Slightly on topic - anyone remember LeapMotion and is anyone aware of any current support for that? Found an original one in a drawer when I was having a clearout the other day

one_more_q 19 hours ago

I have one as well! Sorry no idea what it can be used for nowadays

ugh123 2 days ago

Amazing! Maybe use specific finger positions/gestures to trigger a rotation and scale functions (index finger up and within a bounding box of the model perhaps for rotation, similar for pinch to fingers to scale).

getToTheChopin 2 days ago

I'll try it, thank you! I separated them into completely different interaction modes to avoid misfires, but there's definitely room for efficiencies
- Ey7NFZ3P0nzAe 41 minutes ago
  
  I would be surprised if sign language didn't have an efficient way to convey digits.

danvoell 2 days ago

Awesome, nice work! This type of tech opens up a world of physical games.

getToTheChopin 2 days ago

I made a demo game where you need to dodge the evil bouncing DVD logo by moving your body: https://x.com/measure_plan/status/1924830500541157570
I'm working on a couple other body movement concepts and hope to share soon :)
- hulium 2 days ago
  
  See also Webcam Mania: https://webcam.sulat.net/
  It uses a bit simpler approach, only detecting movement, but it works good enough for such games.

xnx 2 days ago

Sounds very cool, but I could not make sense of the on-screen instructions. Some images or animations would go a long way to explain the controls.

getToTheChopin 2 days ago

Sorry about that, the instructions need to be improved.
Does this video demo help?
https://x.com/measure_plan/status/1929900748235550912
If it makes it clearer, I'll upload it to the github repo directly
- xnx 2 days ago
  
  That video did help. I think I was thrown off by two things: 1) I was expecting 3D controls with more direct mapping (e.g. rotating my hand rotates the model). This is more like gesture mouse controls. 2) Some of the controls were too subtle. The scaling between my gesture size and effect on screen was smaller than I expected.
  Great area to develop though. There's so much untapped potential in applying Mediapipe.
  
  getToTheChopin 2 days ago
  
  Thank you for the feedback. I'll continue to work on it!

ednite 2 days ago

Great job! Looks very useful for interactive content creations and product showcasing. Definitely will testing it more. Thanks for sharing.

getToTheChopin 2 days ago

yes I'd love to go further with this concept so that 3D / CAD designers could easily present their models during video calls.
thank you!

Greta4Gaza 2 days ago

[dead]

appendixv2 2 days ago

[flagged]

nimzoLarsen 2 days ago

[flagged]
thomashop 2 days ago

[flagged]
atuladhar 2 days ago

[flagged]