Small AI models just got a surprising boost from a very old game.
MIT researchers used a Battleship-style setup to test whether AI agents can improve how they gather information before making a move. The result was a sharp jump in performance for smaller systems, including one model that went from rarely beating humans to winning most of its games after researchers changed how it searched the board.
That shift goes straight at one of the biggest weaknesses in today’s AI agents. They’re often asked to handle tasks where the answer depends on details they don’t have yet. MIT’s work suggests better question planning can make a cheaper model act far more capable.
How much smarter did it get
MIT’s test used a version of Battleship built around natural-language questions. One AI agent played the role of the teammate trying to locate hidden ships, while another had access to the board and answered.
Digital Trends
The biggest jump came from Llama 4 Scout. MIT said the smaller model beat human players in only 8% of games at first. After researchers added a more deliberate inference strategy, it beat humans 82% of the time and outpaced a larger frontier model while operating at about 1% of the cost.
That’s the number to watch if you care about AI costs. The model didn’t win by getting larger, but won by choosing sharper questions and making better use of each answer.
Why does Battleship help AI learn
Battleship works as a test because it forces an AI agent to act with limited information. It can’t see the whole board, so every question has to narrow the search and set up the next move.
That maps neatly onto practical AI tools. A support bot, research assistant, or planning agent often needs to ask follow-ups before it can help. When that process breaks down, the model can miss a key detail, repeat itself, or make a recommendation too early.
The MIT approach puts pressure on that weak spot. It measures whether an agent can gather the right information before producing an answer.
Where could this go next
The harder test is whether the same approach works beyond games. Battleship is controlled, which makes it easier to score than open-ended agent workflows in search, customer support, or workplace software.
Still, the direction is worth watching. If smaller models learn to ask sharper questions before acting, companies could build cheaper AI tools that feel more capable in everyday use.
The next milestone is transfer from the game board to real work. A task with unclear instructions, missing files, and a rushed user will be much harder to solve.
The perfect robot mower for you is not nearly as fancy and feature-heavy as you may think. I’ve said it before, and I’ll say it again: it’s not the lawn mower, it’s all about the yard. A robot mower may be a market leader with top-of-the-line specs and still not be a good fit for your yard.
Here’s the great news: There’s a perfect robot mower for almost any yard. As someone who’s tested numerous types of robot lawn mowers, I’ve learned that many of the specs that brands market as groundbreaking are simply not vital for most shoppers. A mostly flat, fenced-in 0.10-acre yard doesn’t need the power that a hilly, sectioned, unfenced one-acre yard does.
A LiDAR, GPS, or wired boundary robot mower works for these yards. If you choose a wired boundary, you may have to bury wire around the flower beds, unless the borders are tall enough for the mower to avoid.
1. Don’t focus on: ‘AI-powered’ or other marketing buzzwords
Maria Diaz/ZDNET
Artificial intelligence (AI) has surpassed the popularity of acid-wash jeans in the 80s and Baby G watches in the early 2000s. And tech companies — including robot lawn mower manufacturers — are capitalizing on its appeal.
Most of these “AI-powered” or “intelligent mowing” terms are vague, geared to grab shoppers’ attention with buzzwords. That doesn’t mean that the robots don’t use AI to navigate, however.
The key is to find out how the robot uses AI to its benefit, and whether that will meet your AI expectations.
AI algorithms typically process data captured by the robot’s hardware to help it make quick decisions and adjustments. For example, a robot lawn mower may have a set of sensors and cameras to capture its surroundings. The robot’s processor then uses AI to convert that information into actionable data, so it knows whether to swerve to avoid an obstacle or slow down around a retaining wall.
Instead, look for: The navigation tech under (and on) the hood
Instead of AI and other buzzwords, you should focus on matching the robot lawn mower’s hardware and navigation system to your yard. This includes whether the robot uses RTK (Real-Time Kinematic) for positioning, and whether it features LiDAR, cameras, and sensors.
Then look at real user reviews to assess how accurately the robot mower maps and how well it performs around various types of obstacles.
There’s no blanket rule for robot mowers, but most do well with the following guidelines.
2. Don’t focus on: Premium extras
Maria Diaz/ZDNET
Skip the premium extras that don’t match your yard. You really don’t need the most advanced robot mower; you need the one that will best handle your lawn.
Most US homeowners have mostly flat lawns, simple rectangular layouts, minimal obstacles, and small yards. Yet some of the most popular mowers advertise features that don’t match this, and you don’t want to spend an extra few hundred dollars on advanced features that won’t deliver a noticeable difference in your yard.
Instead, look for: Only as much as you need
Do you have a mostly flat lawn with no fences and need a robot that can navigate to several sections separated by paths? Then you can skip AWD models and commit to superior mapping and navigation features, like multi-zone intelligence.
Similarly, if you have a yard with dense trees covering most of it, it’s safe to skip the RTK models and go for LiDAR or boundary wire options instead.
3. Don’t focus on: Flashy app features
The path lines created by the Mammotion Luba 2, as captured by our Bink Outdoor camera, is one flashy app feature I can’t quit.
Maria Diaz/ZDNET
Any dependable robot lawn mower requires an equally reliable mobile app to let you use it effectively. However, manufacturers market many flashy app features that end up being unnecessary for many users.
Don’t make app features the deciding factor unless it’s something you genuinely care about. Many users don’t rely on voice control to run their mowers and don’t mind using a separate app for their robot rather than integrating it into an existing home automation system.
A robot lawn mower with mediocre navigation and cutting performance can still have a flashy app — all while leaving behind missed patches or taking longer to finish mowing.
Instead, look for: The features you’ll actually use
Most robot mower users keep them running on a schedule to get the lawn-cutting chore off their minds. The majority of the most popular models offer basic features beyond scheduling, such as remote start and stop, basic mapping, automatic rain delay, and theft protection.
It’s easy to find robot lawn mowers with these features, but if you’re looking for anything beyond that, just be sure that the feature is worth it, especially if you’re paying extra for that model.
An example of a flashy app feature that is completely unnecessary, but I love having? The Mammotion’s pattern cutting. I can select the cutting pattern I want on the Mammotion app, whether I want lines or checkered, but I can also have the robot cut in custom patterns, like letters and numbers. I don’t care for mowed letters in my yard, but I like that it always has that freshly mowed checkered patterned with no effort from me.
4. Don’t focus on: Cutting system extras
Maria Diaz/ZDNET
The cutting width and system specs are important, as they can determine whether a robot can cover a given area in a day. However, most robot mowers use similar multiple-blade mulching systems.
Unlike traditional lawn mowers with large blades for aggressive cutting in a single pass, robot mowers typically feature a set of small blades that constantly spin. Because of this, robot mowers trim smaller amounts of grass with each pass than a traditional mower, but they also cut more frequently and leave behind smaller grass clippings that decompose naturally.
Because the robot mowers have a smaller, compounding cutting system, the real-world differences between the cutting systems from one brand to another are often smaller than you’d expect. Other issues, like poor navigation, will be glaringly obvious before small differences in blade design.
Instead, look for: Cutting width and yard size
The average US yard would benefit more from navigation quality, consistency, and connectivity than blade design. Instead, you should focus on matching the mower to your yard size.
The robot’s capacity is measured in how many acres it can cover in a day. Among other features, this is calculated based on your robot’s battery size and cutting width. Essentially, most users want a robot that can mow an entire yard in a day, so you can set it and forget it and always come home to a mowed yard. You get this by getting the appropriate robot for your yard size.
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional
Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes.The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.