I tested Claude and ChatGPT on a simple task—both failed spectacularly


AI chatbots are incredibly useful for my work. I use them for research, for proofreading and fact-checking, and for keeping track of articles and pitches. There’s one job I had really hoped AI could take off my hands. Sadly, both Claude and ChatGPT failed dismally.

I have no problem with AI handling the mundane tasks

AI should help make us more efficient

1X NEO humanoid robot still from promotional video. Credit: 1X

There’s an excellent quote from author Joanna Maciejewska that completely sums up my feelings about AI. “I want AI to do my laundry and dishes so that I can do art and writing, not for AI to do my art and writing so that I can do my laundry and dishes.”

The same is true of my work. I don’t want AI to write articles for me, and I don’t want to read articles written by AI. They don’t have real-world experience with what they’re writing about.

There are plenty of parts of my work that AI does help with, and it’s the mundane parts, such as fact-checking, that I’m more than happy for AI to make easier. The less time I have to spend on routine tasks, the more time I can spend writing.

There’s one particularly mundane task that I still do manually, and it takes up a lot of time. I was hoping that AI might be able to take it off my hands.

Annotating screenshots is a thankless task

Adding arrows to multiple images eats up time

The Add label button for an area in Home Assistant.

One of the most tedious parts of writing how-to guides is creating step-by-step screenshots. These are really useful for helping readers to follow the guides quickly and accurately, but they’re a pain to create.

First, I have to work through all of the steps and take screenshots of all the relevant parts. This isn’t too painful, as it only takes a second or two to take a screenshot.

The time-consuming parts come when I have to annotate the screenshots. When a screenshot is trying to illustrate a specific menu or button, it’s not always immediately obvious which menu or button the screenshot is focused on. To solve this, arrows are added to the screenshots to indicate the appropriate objects.

Adding these annotations isn’t hard; it’s just time-consuming and tedious, especially when I have a lot of screenshots to annotate. My hope was that I’d be able to point an AI chatbot at a folder of screenshots, upload the text of the step-by-step instructions, provide some examples of previously annotated screenshots, and have the AI add the arrows to the images for me. It felt like this was something that should be possible with the current state of AI tools.


OpenAI ChatGPT app on a laptop


OpenAI is close to turning ChatGPT into a super app—here’s what it will do

You could create and code in one place.

ChatGPT wasn’t up to the job

The results were poor

ChatGPT currently has one of the best image generation models out there. The GPT Image 2 model can produce impressive results that can be very hard to distinguish from real images. I figured that if any chatbot could nail adding arrows to screenshots, it would be ChatGPT.

I uploaded the screenshots and step-by-step instructions and gave ChatGPT an example image so it knew what the annotations should look like. ChatGPT got to work and annotated all the screenshots for me.

The results were terrible. The arrows were pointing to completely the wrong things or to nothing at all. Some of the arrows were mangled, and not a single screenshot was even close to being usable. I tried again with refined prompts that were more explicit to try to solve these issues, but every time I tried, I got the same poor results.

Even with computer use, Claude failed

Full control of my computer wasn’t enough

Having failed with ChatGPT, I decided to give Claude a go. While Claude didn’t have native image generation tools, I’d used a Claude feature called computer use before, which lets Claude interact with a computer using screenshots, mouse control, and keyboard input. I figured that using this tool, Claude could take control of apps such as Preview, annotate the screenshots, see the results of its efforts, and adjust the positions of the arrows until they were exactly right.

I set everything up, pointed Claude at the folder of screenshots, uploaded the step-by-step instructions it needed to follow, provided some example screenshots, and gave it all the necessary permissions. I set Claude to work and sat back, waiting for my perfectly annotated screenshots to be generated.

Once again, I was hugely disappointed. The arrows often partially obscured the objects that they were meant to point to, pointed past them, or had ridiculously long tails. The icing on the cake was that Claude saved its annotated files over the top of the ones I’d generated with ChatGPT, completely destroying those files.

I tried refining my prompts and giving clearer instructions. Using Opus instead of Sonnet, I was able to get some of the results to be half-decent. The problem was that it wasn’t reliable; I could never manage to get a full set of images that didn’t need some kind of fixing, which completely defeats the purpose.


AI still can’t do everything perfectly

AI is really useful, but there are still things it can’t do without significant human review or correction. The problem is that when AI makes mistakes, fixing those mistakes can often take longer than just doing the job myself. For now, I’ll have to stick to annotating screenshots myself. Maybe Fable 5 can pull it off if we ever see it in action again.



Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


Pixar is the champion of animation, but not all of their movies have had the chance to shine. For 40 years, the studio has brought families together across 30 movies. Certain movies never enter the discussion of being among the studios’ best — they were overshadowed by other films, or they went direct-to-streaming on Disney+.

In honor of the 40th anniversary, here are four Pixar movies that are worth reevaluating in 2026.

Toy Story 4

A surprisingly strong sequel

In 2010, Toy Story 3 brought Pixar’s debut franchise to an emotional close, as Woody (Tom Hanks), Buzz (Tim Allen), and the gang said farewell to Andy, preparing for a new life with Bonnie (Madeleine McGraw). After bringing their genre-defining animated trilogy to a fitting conclusion, I was doubtful that any follow-up could ever live up to the trilogy’s legacy. However, I was pleasantly surprised when I finally found the time to watch Toy Story 4.

As the gang of toys and Bonnie embark on a trip, Woody sets out to help the handcrafted toy Forky (Tony Hale) while also reuniting with Bo Peep (Annie Potts), who has become a rescuer of stray toys. As expected, Pixar’s animation remains ever-impressive, but Toy Story 4 manages to recapture the charm of the original 3 movies and offer a surprisingly fitting epilogue to Woody’s story in particular. Even with a new installment on the horizon, the emotion behind Toy Story 4‘s major status quo change for the gang ensures that the movie will be able to stand on its own merits for many years to come.

Turning Red

A stylistic reinvention

2022’s Turning Red saw Pixar take another crack at a coming-of-age story. The young Mei (Rosalie Chiang) clashes with her mother, Ming Lee (Sandra Oh), leading to her learning that she inherited the power to turn into a gigantic red panda in moments of heightened emotion. With her favorite boy band in town, Mei and her friends plan to use these gifts to attend the concert. As the concert draws nearer, however, Mei continues to clash with her mother, building to a generational showdown to heal her family’s curse.

Amazon Fire TV Stick 4K Max

Integrations

Alexa

Storage

16GB


When compared to what came before, Turning Red is a drastic stylistic departure from Pixar’s filmography. Mei’s story is told in a more informal manner when compared to other features, as Mei breaks the fourth wall and is incredibly expressive when compared to how past features tiptoed the line between cartoon and realism. However, this stylistic decision gives Turning Red a unique charm while making its story feel all the more personal and emotional, as we are given a clearer insight into Mei’s state than any other Pixar protagonist that has come before.​​​​​​​

Monsters University

Expanding a universe

While Toy Story had proven that Pixar could create successful sequels, expanding on a movie was still a rare move for the studio in the early 2010s, with said franchise and Cars being an exception. As such, Monsters University had a lot of pressure placed upon its shoulders when it released. Set several years before the events of Monsters Inc, the prequel explores how Mike (Billy Crystal) and Sully (John Goodman) went from fierce rivals to the firmest of friends during their time at the titular scaring school.

Blending the setting and cast of Monsters Inc. with a teen college movie was an ideal choice to expand the world of this Pixar movie, as most of the charm found in Monstropolis comes from how it drastically imagined elements of our own world in its monstrous lens. Furthermore, it is interesting to see that Sully and Mike began as rivals, and Mike’s arc focusing on his struggle to be a scarer does add layers to where his journey ends in the original movie. As such, Monsters University is a worthy prologue to one of Pixar’s most enduring franchises.​​​​​​​

Soul

A deeper tale with age

Pixar is unafraid to tackle deeper and more mature subjects. However, I feel Soul stands as one of their most ambitious explorations yet. On the verge of fulfilling his dream, Joe (Jamie Foxx) is caught in a near-death experience, leading to him becoming a disembodied soul in the “Great Before.” When his soul is tasked to guide the reluctant 22 (Tina Fey) into finding the passion that will drive her during her time on Earth, Joe is taken on a journey to not only return to his body but also reconsider what drives him and what is important in life.

For a studio that has prided itself on packaging deeper themes into a family-friendly package, Soul easily stands as a movie that feels targeted for its older viewers. Children may be inspired to take joy in everything life can offer through 22’s journey, but Joe’s story is particularly relatable to those who have had to grapple with their passions being lost or an unpredictable turn in life putting a stop to a dream, and watching him regain that through his experiences with 22 is incredibly emotional. While it may not have had a chance to shine at the box office, Soul will stand as a fondly remembered Pixar classic. Hopefully, new viewers and young fans can begin to see the movie through different perspectives as they face their own trials.​​​​​​​


Subscription with ads

Yes, the Disney Basic plan

Simultaneous streams

Up to 4

Live TV

No

Price

Starting at $10/month

Family favorites, old classics, and exciting new TV all in one place.




Source link