
Bing Vision is an image recognition application created by Microsoft which is installed on Windows Phones running version 7.5 and above, including Windows Phone 8. It is a part of the Bing Mobile suite of services, and on most devices can be accessed using the search button. On Windows Phone 8.1 devices where Microsoft Cortana is available, it is only available through the lenses of the Camera app. Bing Vision can scan barcodes, QR codes, Microsoft Tags, books, CDs, and DVDs. Books, CDs, and DVDs are offered through Bing Shopping.

Computer Vision Annotation Tool (CVAT) is a free, open source, web-based image and video annotation tool which is used for labeling data for computer vision algorithms. CVAT was developed for use by a professional data annotation team, with a user interface optimized for computer vision annotation tasks. Try it online cvat.org.

Crowding is a perceptual phenomenon where the recognition of objects presented away from the fovea is impaired by the presence of other neighbouring objects. It has been suggested that crowding occurs due to mandatory integration of the crowded objects by a texture-processing neural mechanism, but there are several competing theories about the underlying mechanisms. It is considered a kind of grouping since it is "a form of integration over space as target features are spuriously combined with flanker features."

Face detection is a computer technology being used in a variety of applications that identifies human faces in digital images. Face detection also refers to the psychological process by which humans locate and attend to faces in a visual scene.

Gesture recognition is a topic in computer science and language technology with the goal of interpreting human gestures via mathematical algorithms. Gestures can originate from any bodily motion or state but commonly originate from the face or hand. Current focuses in the field include emotion recognition from face and hand gesture recognition. Users can use simple gestures to control or interact with devices without physically touching them. Many approaches have been made using cameras and computer vision algorithms to interpret sign language. However, the identification and recognition of posture, gait, proxemics, and human behaviors is also the subject of gesture recognition techniques. Gesture recognition can be seen as a way for computers to begin to understand human body language, thus building a richer bridge between machines and humans than primitive text user interfaces or even GUIs, which still limit the majority of input to keyboard and mouse and interact naturally without any mechanical devices. Using the concept of gesture recognition, it is possible to point a finger at this point will move accordingly. This could make conventional input on devices such and even redundant.

Microsoft PixelSense is an interactive surface computing platform that allows one or more people to use and touch real-world objects, and share digital content at the same time. The PixelSense platform consists of software and hardware products that combine vision based multitouch PC hardware, 360-degree multiuser application design, and Windows software to create a natural user interface (NUI).

Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class in digital images and videos. Well-researched domains of object detection include face detection and pedestrian detection. Object detection has applications in many areas of computer vision, including image retrieval and video surveillance.

Pedestrian detection is an essential and significant task in any intelligent video surveillance system, as it provides the fundamental information for semantic understanding of the video footages. It has an obvious extension to automotive applications due to the potential for improving safety systems. Many car manufacturers offer this as an ADAS option in 2017.

Umoove is a high tech startup company that has developed and patented a software-only face and eye tracking technology. The idea was first conceived as an attempt to aid people with disabilities but has since evolved. The only compatibility qualification for tablet computers and smartphones to run Umoove software is a front-facing camera. Umoove headquarters are in Israel on Jerusalem’s Har Hotzvim.