You have an extremely detailed world model including a mental model of the drivers and other road users around you. You rely on sight, sound, experience and lots of knowledge. You are aware of the social contracts at work when dealing with shared resources and your brain is many orders of magnitude more powerful than any box full of electronics.
What you can do with 'just vision' misses the fact that you are part of the hardware.
You rely on a moving camera, microphones and vibrations all together. Driven by a supposedly more advanced meatware than what tech can create today, so that it can properly reason even with faulty/missing signals.