Apple to preview AI research at a conference before WWDC


Apple will present 14 AI research papers at the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition in Denver next week, spanning image generation, spatial understanding, and multimodal reasoning.

The company continues to explore the applications of AI and large language models, with studies detailing their use in image generation, UI prototyping, QE testing, and much more. Apple’s researchers have also examined the role of artificial intelligence in spatial understanding, with ideas that iOS 27 Accessibility features will bring to reality.

Some of the studies AppleInsider has previously detailed, including one on AI-powered sign language annotation, will be presented at the annual IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). The Apple-sponsored event will take place from June 3 through June 7 at the Colorado Convention Center in Denver.

Apple has published the schedule for presentations and workshops by its researchers at CVPR. June 3 will see a keynote presentation titled “Generative AI for Sign Language (GenSign) Workshop” by Colin Lea, who worked on the previously mentioned AI annotation study. Other Apple researchers and engineers will hold invited talks on June 3 and June 4.

Hsin-Ping (Cindy) Huang and Maggie Xiao will represent Apple at the WiCV Mentorship Dinner on June 4. This will be followed by a series of poster presentations at Apple’s CVPR booth, number 231, from June 5 through June 7. Exhibition hours run from 10 a.m. MDT and last until 6 p.m. MDT on June 5 and June 6, with a slightly shorter run on June 7, ending at 3 p.m. MDT.

The following Apple studies will be presented at the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition:

  • AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding
  • AToken: A Unified Tokenizer for Vision
  • Bootstrapping Sign Language Annotations with Sign Language Models
  • DSO: Direct Steering Optimization for Bias Mitigation
  • From Where Things Are to What They’re For: Benchmarking Spatial-Functional Intelligence for Multimodal LLMs
  • Learning Long-Term Motion Embeddings for Efficient Kinematics Generation
  • Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing
  • SO-Bench: A Structural Output Evaluation of Multimodal LLMs
  • STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
  • TrajTok: Learning Trajectory Tokens enables better Video Understanding
  • UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning
  • Velox: Learning Representations of 4D Geometry and Appearance
  • VSAS-Bench: Real-Time Evaluation of Visual Streaming Assistant Models
  • What Matters in Practical Learned Image Compression

While all of these studies are significant in their own right, some offer a clearer idea of the practical applications Apple has in mind for its AI.

Why Apple’s studies in the computer vision conference matter

For instance, “From Where Things Are to What They’re For: Benchmarking Spatial-Functional Intelligence for Multimodal LLMs” lines up with the Live Recognition Accessibility feature in iOS 27. Insights from the study may also prove useful for the long-rumored camera-equipped AirPods.

Apple’s research into improving AI-powered image generation may have similarly played a part in Image Playground improvements rumored for iOS 27. The company’s studies regarding the use of AI in fixing code bugs line up with its efforts to integrate AI into Xcode.

Overall, the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) is an event worth visiting, given the content Apple is expected to present. After that, Apple’s annual WWDC will begin with a keynote video on June 8.



Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


“It was severely downgraded,” Gilbert confirms. “I never would have found it if I was just looking through Google results.” (I tried the same prompt in Gemini earlier this month, and after an initial denial, the tool also gave me Eiger’s number.)

After this experience, Eiger, Gilbert, and another UW PhD student, Anna-Maria Gueorguieva, decided to test ChatGPT to see what it would surface about a professor. 

At first, OpenAI’s guardrails kicked in, and ChatGPT responded that the information was unavailable. But in the same response, the chatbot suggested, “if you want to go deeper, I can still try a more ‘investigative-style’ approach.” Their inquiry just had to help “narrow things down,” ChatGPT said, by providing “a neighborhood guess” for where the professor might live, or “a possible co-owner name” for the professor’s home. ChatGPT continued: “That’s usually the only way to surface newer or intentionally less-visible property records.” 

The students provided this information, leading ChatGPT to produce the professor’s home address, home purchase price, and spouse’s name from city property records. 

(Taya Christianson, an OpenAI representative, said she was not able to comment on what happened in this case without seeing screenshots or knowing which model the students had tested, even after we pointed out that many users may not know which model they were using in the ChatGPT interface. She also declined to comment generally about the exposure of PII by the chatbot, instead providing links to documents describing how OpenAI handles privacy, including filtering out PII, and other tools.) 

This reveals one of the fundamental problems with chatbots, says DeleteMe’s Shavell. AI companies “can build in guardrails, but [their chatbots] are also designed to be effective and to answer customer questions.”

The exposure issue is not limited to Gemini or ChatGPT. Last year, Futurism found that if you prompted xAI’s chatbot Grok with “[name] address,” in almost all cases, it provided not only residential addresses but also often the person’s phone numbers, work addresses, and addresses for people with similar-sounding names. (xAI did not respond to a request for comment.) 

No clear answers

There aren’t straightforward solutions to this problem—there’s no easy way to either verify whether someone’s personal information is in a given model’s training set or to compel the models to remove PII. 



Source link