• Adalast@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Your points on MV are not unfounded, but they are also extremely homeocentric. All of your examples rely on the visible light spectrum as well as standard “vision” as we know it. Realistically any sensor can be used to generate an image if you know what you are doing with it. Radio telescopes are a great example of this. There is also a lot of research going on in giving AI’s MV senses access to other sections of the EM spectrum ( https://www.edge-ai-vision.com/2017/10/beyond-visible-light-applications-in-computer-vision/ and https://www.technologyreview.com/2019/10/09/132696/machine-vision-has-learned-to-use-radio-waves-to-see-through-walls-and-in-darkness/ ) as well as echolocation ( https://www.imveurope.com/news/echolocation-neural-net-gives-phones-3d-vision-sound ). There are many other types of “vision” that can be used that can definitely distinguish a popcan.

    • agent_flounder
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Agree that other parts of the EM spectrum could enhance the ability of MV to recognize things. Appreciate the insights – maybe I will be able to use this when I get back to tinkering with MV as a hobbyist.

      Of course identifying one object is one level. For a general purpose replacement for humans ability, since that’s what the thread is focused (ahem) on, it has to identify tens of thousands of objects.

      I need to rethink my opinion a bit. Not only how far general object recognition is but also how one can “cheat” to enable robotic automation.

      Tasks that are more limited in scope and variability would be a lot less demanding. For a silly example, let’s say we want to automate replacing fuses in cars. We limit it to cars with fuse boxes in the engine bay and we can mark the fuse box with a visual tag the robot can detect. The layout of the fuses per vehicle model could be stored. The code on the fuse box identifies the model. The robot then used actuators to remove the cover and orients itself to the box using more markers and the rest is basically pick and place technology. That’s a smaller and easier problem to solve than “fix anything possibly wrong with a car”. A similar deal could be done for oil changes.

      For general purpose MV object detection, I would have to go check but my guess is that what is possible with state of the art MV is identifying a dozen or maybe even hundreds of objects so I suppose one could do quite a bit with that to automate some jobs. MV is not to my knowledge at a level of general purpose replacement for humans. Yet. Maybe it won’t take that much longer.

      In ~15 years in the hobbyist space we’ve gone from recognizing anything of a specified color under some lighting conditions to identifying several specific objects. And without a ton of processing power either. It’s pretty damn impressive progress, really. We have security cameras that can identify animals, people, and delivery boxes. I am probably selling short what MV will be able to do in 15 more years.