I switched from LM Studio to llama.cpp, and I’m never going back to a bloated wrapper


Running AI locally sounds like it should be straightforward until you realize that the app making it feel easy is quietly eating the resources you actually need. I spent time with LM Studio before I started noticing that my hardware was working harder to keep the interface alive than to run the model itself. However, Llamma.cpp is much better and can even run on Raspberry Pi.

LM Studio has too much bloat

I ditched the heavy wrappers for raw llama.cpp

Llama next to a task manager Credit: Jorge Aguilar / HowToGeek

When I started running AI locally, I gravitated toward tools like LM Studio. It is pretty easy to see why, since it is very popular thanks to its model search, downloading, and chat interface. It doesn’t feel much different than using any other app on your computer, and you don’t even need a NAS.

All that convenience comes at a price, though, because the packaging just hides what is actually doing the work. LM Studio, Ollama, and GPT4All are all local AI running the same core engine underneath, which is llama.cpp.

What is different is everything that is built around that engine. Heavy GUI managers force your OS to burn memory and CPU cycles just to keep the interface alive. My hardware was spending its budget rendering visual elements and maintaining API translation layers instead of doing the actual AI work. I didn’t spend long on LM Studio because it was clearly going overboard.

The main culprit is that most of these managers are built on Electron, which ships a full Chromium browser engine bundled with a Node.js runtime. That’s expensive even when the AI isn’t doing anything.

In practice, LM Studio alone can sit at 1.40 GB of RAM and pull up to 1.2 GB of GPU VRAM just as background overhead. On an 8 GB card, that’s not a minor inconvenience; it directly determines which models you can even load. Every megabyte the wrapper takes is a megabyte the model doesn’t get.

Running llama.cpp as a native binary cuts all of that out. While other AI may force your PC to waste memory just from the empty UI, llama.cpp keeps its background footprint down low. When it is running, it doesn’t have to be more than a regular browser. Wrappers also add latency. You get prompt ingestion, which is just the wait time before you see the first token. There was a noticeable difference between running llama.cpp and using LM Studio.

Bypassing the wrapper fixed that. There’s another upside, too, because llama.cpp moves fast, and GUI tools always lag behind its release cycle by weeks. Running it directly means new features like multi-modal audio inputs are available the moment they ship.

You get real control for a smaller learning curve

The learning curve of a command-line interface can feel intimidating coming from a GUI. I remember that I had thought that any time I was using a command line, I was likely going to break something on the PC. However, if you switch to raw llama.cpp it’s worth learning.

To get llama.cpp running on your PC, you need files from two places, pull them both into the same local folder, and you’re basically done.

Start at the llama.cpp GitHub repository. Go to the latest release and download the pre-compiled zip that matches your hardware. Create a folder somewhere convenient and unzip everything into it.

Then head to Hugging Face, grab whichever model you want in GGUF format, but a lighter one is smarter for testing, and drop that file into the same folder.

To run it, type cd then the path from the folder. Then name the AI in a script with the first prompt, and you can start talking.

Make sure to use the launch string with the model filename before your first prompt. Here is what I used llama-cli -m meta-llama-3-8b-instruct.Q4_K_M.gguf -ngl 99 -p "Why is running AI via raw llama.cpp better than a heavy GUI wrapper?"

The performance difference is hard to ignore once you see it. Idle VRAM usage drops from several gigabytes to a fraction of one. Prompt processing speeds jump significantly enough that I noticed it on the first request. Stripping out the GUI and tuning things yourself sounds complicated, but you will definitely see the difference.

The trade-off is worth it

The performance gains make it hard to go background

AI for llama on server Credit: Jorge Aguilar / HowToGeek

It’s easy to see why someone would argue that a GUI is better for beginners. Apps like LM Studio offer a comfortable, pick-up-and-play experience that hides the messy side of deployment. If you’re really that into a GUI, I’d recommend GPT4All over LM Studio because it’s not as restrictive or hard on your PC.

You can make this look like a regular chatbot if you run the code with your model and then -ngl 99 and the URL is http://localhost:8080. It just won’t run as well.

To most people, running a language model through a terminal looks like developer territory. Learning to go through directories and set execution parameters takes time, and that can put people off. Convenience would be why you’d head to heavy wrappers. However, treating local AI like a casual desktop app means paying a real performance price for all that graphical overhead.

I’m not willing to give up over a GB of VRAM just to keep an interface running. It is a huge waste. Learning the llama.cpp interface removes all of that, and you only have to learn it once. After that, your machine can focus on the actual work.

Now that I am used to the speed and control, going back to a heavy interface feels like a genuine step backward. It feels like giving up performance just for a pretty interface. Since llama.cpp includes a built-in web server, it’s not like you’re stuck staring at a terminal either. A little work learning a few commands gets you a much faster, cleaner setup.


The terminal is the difference maker

Switching to raw llama.cpp isn’t for everyone. If you’re not comfortable working from a terminal yet, the learning curve is real, even if it’s shorter than it looks. GPT4All is a more reasonable starting point than LM Studio if you want a GUI that doesn’t punish your hardware for existing. That said, once you’ve run a model without the wrapper overhead even once, it’s hard to unsee the difference. For a lot of setups, it’s the difference between loading the model you actually want and settling for something smaller.



Source link

Leave a Reply

Subscribe to Our Newsletter

Get our latest articles delivered straight to your inbox. No spam, we promise.

Recent Reviews


The iPhone Shortcuts app reminds me of Minecraft. It might be relatively easy to jump into, but it offers nearly limitless potential, allowing you to build anything you want. The same holds true for the Shortcuts app, and that endless possibilities are what many iPhone users might find intimidating. But you don’t have to.

If you are new to iPhone shortcuts, think of them as little automated helpers. You can build them yourself or find ones that others have built and use them. And that’s the beauty of shortcuts. If you don’t want to get your hands dirty, you can find shortcuts others have created and tailor them to your needs. 

With that said, let’s check out my favorite shortcuts. These are not the best shortcuts on everyone’s list, but they are the ones I use daily to get things done faster and more efficiently.

App settings: stop digging through the settings app

Anyone who has spent more than five minutes hunting for an app’s permissions inside the Settings app knows how frustrating it can be. You have to open the Settings app, scroll all the way down, open the Apps section, scroll again to find your app, and only then can you enter its settings. 

This shortcut fixes that completely. It uses the Get Current App and Open URLs actions in the Shortcuts app to detect which app you are currently in and jump straight to its settings page. Once you set it up and add it to your Control Center, all you have to do is open the app, swipe down from the top, and tap the shortcut. 

It will automatically open the current app’s settings. It is genuinely one of the most practical shortcuts I have ever created, and you can download it using the link below. 

Get App settings shortcut

Apple Frames 4: make your screenshots look professional

If you ever share screenshots on social media, a blog post, or a presentation, this shortcut is for you. Apple Frames 4 is a free shortcut by Federico Viticci of MacStories, which can wrap your screenshots in a proper device frame.

The latest version is noticeably faster, supports all recent Apple devices, and even lets you choose frame colors and scale the images proportionally. What I love most about this shortcut is that it can take multiple screenshots as input and combine them in one image. 

All the images in this article have been created using the same shortcut. If you also take screenshots regularly, I can highly recommend this shortcut. I would also recommend you check out my favorite screenshot utility for Mac. It offers all the missing features of Mac’s built-in screenshot tool and then some. 

Get Apple Frames shortcut

Scan document: your pocket scanner is already in your hand

You don’t need a third-party app to scan documents on an iPhone. You don’t even need to open the Notes or Files app the usual way. With this shortcut, you can open the document scanner instantly and scan and save papers without any extra steps.

I have it in my Home Screen and use it whenever I need to quickly scan a receipt, a letter, or any paper document. It’s one of those shortcuts that sounds simple until you realize how much time it saves you every week.

Get Scan Documents shortcut

Resize & convert: resize images without downloading a third-party app

How many times have you shared a photo only to find out it was too large, or in the wrong format for where you needed it? Since the iPhone Photos app doesn’t let you resize an image or change its format, I found a simple shortcut to do it. 

The steps are pretty easy, too. You pick the image, set the size, and the shortcut handles the rest. I use this a lot when I need to send images for articles or posts that require specific dimensions. 

It handles a task I would otherwise have to do on my Mac or download a third-party app on my iPhone to complete. 

Get Resize & convert shortcut

Extract PDF pages: pull out only what you need

I deal with a lot of PDFs, and sometimes I need to extract a few pages to share or save. So I downloaded a shortcut that lets you select specific pages from a PDF and extract them into a new file.

It sounds like a small thing, but if you have ever had to send someone just two pages from a 40-page PDF, you know how handy this is. You don’t need to download any app, pay a subscription, or open your Mac. Your iPhone handles it in seconds.

Get Extract PDF shortcut

Clipboard history: because you always lose what you copied

This is one of the most underrated shortcuts on this list. While macOS has finally added a clipboard history feature with the macOS Tahoe update, the iPhone still doesn’t have a clipboard history. That means every time I copy something on my iPhone, it erases all the previously copied items. 

So I built a shortcut to work around it. Now, every time I copy something on my iPhone, it saves to a note, creating a running clipboard history I can refer back to whenever I need it. The only issue is that I have to run the shortcut manually for it to work. 

So that’s why I have added it to the Back Tap gesture (go to Settings → Accessibility → Touch → Back Tap) on my iPhone. Once I copy something I want to save, I simply tap the back of my iPhone three times to trigger the shortcut and save the copied item in a preassigned note. 

When you download the shortcut, make sure to edit it by tapping the three-dot menu and selecting the note you want to use as your clipboard history.

Get Clipboard History shortcut

Turn off mobile data when iPhone connects to Wi-Fi

To balance the manual activation of the last shortcut, I give you one that is pure automation. Once you set it up, you never have to think about it again. The shortcut uses the Shortcuts automation feature to detect when your iPhone connects to a Wi-Fi network and automatically turns off your mobile data.

I have also set up the companion automation that turns mobile data back on when you leave Wi-Fi. It saves battery life and prevents your phone from uselessly using mobile data when it doesn’t need to. Since this is an automation, there’s no way to share a downloadable link, but you can learn how to create this shortcut. The screenshot should give you the basics of how to do it.

My 7 favorite iPhone shortcuts

I know the Shortcuts app can feel intimidating at first, but most of these require very little setup, and the payoff is immediately obvious. Start with one that solves a problem you have right now, and before long, you will be building your own.

If you have an iPhone and are not using Shortcuts, you are missing out on one of the most powerful tools Apple has built. So, definitely give this a try, and your life will never be the same.



Source link