Hey #OpenCV #ComputerVision #Python
I would like to point a camera at an area of the house and have it announce when a dog has entered the camera frame.
I am quite handy with Python and can muddle my way through C-like stuff if I have good documentation or example code.
Is this easy or hard? Hard is not a dealbreaker, just trying to tune my expectations a bit.
edit: strictly speaking, it only needs to spot two dogs, that I have many photos of, but a generic dog detector would also be fine.
Meet #PaliGemma 2 - Google DeepMind’s latest leap in vision-language models (VLM)!
Available in 3 different sizes & input image resolutions, PaliGemma 2 achieves state-of-the-art performance on several vision-language benchmarks.
Details on #InfoQ https://bit.ly/4gOMEBX
Beyond Fairness in Computer Vision: A Holistic Approach to
Mitigating Harms and Fostering Community-Rooted Computer
Vision Research
Timnit Gebru and Remi Denton
"ABSTRACT: The field of computer vision is now a multi-billion dollar enterprise, with its use in surveillance applications driving
this large market share. In the last six years, computer vision researchers have started to discuss the risks and harms of some of these systems, mostly using the lens of fairness introduced in the machine learning literature to perform this analysis. While this lens is useful to uncover and mitigate a narrow segment of the harms that can be enacted through computer vision systems, it is only one of the toolkits that researchers have available to uncover and mitigate the harms of the systems they build.
In this monograph, we discuss a wide range of risks and harms that can be enacted through the development and deployment of computer vision systems. We also discuss some existing technical approaches to mitigating these harms, as well as the shortcomings of these mitigation strategies.
Then, we introduce computer vision researchers to harm mitigation strategies proposed by journalists, human rights activists, individuals harmed by computer vision systems, and researchers in disciplines ranging from sociology to physics. We conclude the monograph by listing principles that researchers can follow to build what we call community rooted computer vision tools in the public interest, and give examples of such research directions. We hope that this monograph can serve as a starting point for researchers exploring the harms of current computer vision systems and attempting to steer the field into community-rooted work."
https://cdn.sanity.io/files/wc2kmxvk/revamp/79776912203edccc44f84d26abed846b9b23cb06.pdf
#AI #MachineLearning #BiasInAI #STEMSaturday #DeepLearning #ComputerVision #Robotics #ReinforcementLearning
Meet the editors of "Mitigating Bias in Machine Learning" Dr. Carlotta Berry and Dr. Brandeis Hill Marshall (Brandeis Marshall, PhD)
This practical guide shows, step by step, how to use machine learning to carry out actionable decisions that do not discriminate based on numerous human factors, including ethnicity and gender.
On Sale On Amazon https://a.co/d/dtMizVH
Attention the #python PyPI package of the popular object detection model #YOLO in its implementation by #Ultralytics has been compromised.
There is an angoing investigation about the matter:
https://github.com/ultralytics/ultralytics/issues/18027
For now it would be best do uninstall the package.
https://nianticlabs.com/news/largegeospatialmodel/
“Niantic is in a unique position to lead the way in making a Large Geospatial Model a reality, supported by more than a million user-contributed scans of real-world places we receive per week.” #Maps #ComputerVision
I’m curious whether this will go any further than LLMs have gone - or fall short. How valuable is a model like this without fully labeled, real-world objects & features?
Introducing IMGS.AI, a multimodal search engine revolutionizing Digital Art History! Built with cutting-edge ML models like CLIP, it addresses image retrieval challenges and proposes solutions for standardizing feature extraction.
By @zentralwerkstatt & @peterbell
#DigitalArtHistory #MachineLearning #ComputerVision #CLIPModel #HCI #BigImageData #FeatureExtraction
https://dahj.org/article/imgsai
Exciting news in wearable assistive tech! Be My Eyes is now available on Meta's Ray-Ban smart glasses in the UK.
Based on my experience with WhatsApp video calls, it might take some practice to get the hang of it—hopefully, the volunteers won't end up with motion sickness!
The future of accessibility is looking bright.
#Accessibility #AI #BeMyEyes #Blind #computerVision #Disability #Meta