Civitai’s Quickstart Guide to Large Language Models!

LLM, Quickstart Guides

Last Updated	Changes
12/31/2024	First Version
1/28/2025	DeepSeek R1

Large Language Models (LLMs)?

What Are LLMs?

Large Language Models (LLMs) are a class of artificial intelligence systems designed to understand and generate human-like text. They are powered by advanced neural networks trained on massive datasets, often consisting of books, articles, and online content, allowing them to grasp patterns, context, and nuances in language. At their core, LLMs are built using a type of machine learning architecture called Transformers, which enables them to process and generate text in a highly contextual and coherent manner.

LLMs excel at a wide range of tasks, including answering questions, summarizing information, generating creative content, translating languages, and even coding. They work by predicting the most likely sequence of words based on a given input, which allows them to produce fluent and contextually relevant responses.

LLMs are not just tools for experts – they’re accessible to anyone interested in enhancing their day-to-day productivity, or exploring their creative potential. Whether you’re a developer looking to integrate AI into your coding projects or simply curious about their capabilities, this guide will provide a very quick and practical introduction to get you started, without getting too technical!

Check out our LLM Glossary for more information on some of the terminology you’ll come across in this Guide!

Key Applications and Benefits

(LLMs) are versatile tools that have found applications across many facets of everyday life, making them indispensable for both personal and professional use. Here are just a few of their applications and the benefits they bring:

Content Creation

Assisting in creative writing, proofreading, drafting emails, and cover letters.

Education

Explaining complex topics in simple terms.

Generating study guides, or summaries of lengthy information.

Searching the web.

Programming Assistance

Writing, debugging, and optimizing code in a number of common (and not so common!) languages.

Explaining coding concepts, or describing the function of code snippets.

Automating repetitive programming tasks.

Customer Support

Powering Chatbots, and virtual assistants! Try out our own GPT-4 powered EduBot!…

Providing 24/7 helpdesk support (see CivBot on Civitai.com!)

Personalizing customer interactions with natural, conversational responses.

Entertainment

Creating narratives, character backstories, or lore.

Popular LLM Models

LLMs come in two main types: closed-source and open-source. Closed-source LLMs are proprietary systems developed by organizations that keep their code and data confidential. These models often require substantial computational resources, making them impractical to run on standard consumer hardware; instead, users access them via cloud services or APIs (see OpenAI’s ChatGPT, or X/Twitter’s Grok). In contrast, open-source LLMs have publicly available code and, in some cases, model weights, allowing developers and casual users to run and modify them locally. Many open-source models are optimized for efficiency, enabling users to run them on consumer-grade computer hardware.

Here’s an overview of some prominent Closed and Open-Source LLMs:

DeepSeek

In late January 2025, the AI world was shaken up by the release of a new Chinese-developed fully-open-source, MIT licensed, model, DeepSeek R1. According to its creators, they managed to train the R1 model for just $6 million (£4.8 million), a fraction of the “over $100 million” cost OpenAI’s CEO Sam Altman mentioned for GPT-4, with performance on-par OpenAI’s flagship o1 model.

The release made waves in both the AI research community and the broader tech industry, with U.S. President Donald Trump weighed in, calling it a “wake-up call” for American companies to step up their AI game.

The impact was immediate and massive – especially in the stock market. Nvidia, a key player in AI hardware, saw nearly $600 billion (£482 billion) wiped off its market value in a single day, marking the largest one-day loss in U.S. history. This underscored just how disruptive cost-effective AI models like DeepSeek could be for the global tech and finance industries.

DeepSeek’s appeal doesn’t stop there. It has official Android and iOS apps and can run locally, delivering GPT-4-like performance from a surprisingly small, lightweight model.

A great way to dive into DeepSeek R1 locally is through LM Studio (details below!). It’s an excellent tool that supports a range of quantized models, making it possible to run DeepSeek R1 even on smaller GPUs with ease.

ChatGPT (Generative Pre-trained Transformer): Developed by OpenAI, GPT models are renowned for their advanced language processing capabilities, excelling in tasks like text generation, translation, and summarization. The latest iteration, GPT-4o, offers enhanced reasoning, understanding, and advanced voice features.

Grok: Introduced by xAI for the X (Twitter) platform, Grok is designed to provide real-time information access and advanced reasoning, aiming to compete with leading AI models.

GitHub Copilot: A collaboration between GitHub and OpenAI, Copilot serves as an AI-powered coding assistant. It integrates seamlessly into popular code editors, offering real-time code suggestions and helping developers by autocompleting code and providing context-aware recommendations.

Gemini: Developed by Google DeepMind, Gemini is a multimodal AI model capable of processing and generating content across various formats, including text, images, audio, and video.

Microsoft Copilot: Integrated into Microsoft’s suite of products, including Office applications like Word and Excel, Copilot assists users by generating content, automating tasks, and providing intelligent suggestions to enhance productivity

Llama (Large Language Model Meta AI): An open-source LLM developed by Meta (Facebook), Llama is designed to be efficient and accessible, enabling researchers and developers to build upon its architecture for various applications. The latest version, Llama 3.3, challenges the dominance of closed-source alternatives.

Mistral 7B: A compact yet powerful open-source LLM with 7 billion parameters, Mistral 7B is optimized for efficiency, making it suitable for a range of applications without requiring extensive computational resources.

LLMs, Ranked

With so many LLMs out there, figuring out which one suits your needs can be a bit of a challenge. That’s where the Chatbot Arena comes in. Developed by researchers at UC Berkeley’s SkyLab, it’s an open-source platform that lets users pit two anonymous LLM models against each other by inputting a prompt and comparing their responses. Users then vote for the one they prefer, and these votes contribute to a live leaderboard ranking various models.

Running LLMs Locally

Running an LLM locally has a lot of perks, especially if you care about privacy, customization, or having full control. It keeps your data on your own device, so you’re not sending sensitive info off to a cloud server. Plus, you can tweak the model to do exactly what you need – like training it on your own data for specific tasks.

Hardware Requirements

While the big “triple A” models might require top-of-the-line GPUs, there are smaller, optimized models like LLaMA, Alpaca, or Mistral that can deliver great performance on lower-end systems.

Some of these can even run on a standard laptop or a modest desktop PC with as little as 4 to 8GB of VRAM. These smaller models are designed to be efficient while still maintaining impressive capabilities for tasks like text generation, summarization, or coding help. This makes local LLMs accessible to hobbyists, developers, and researchers who want to dive into the world of LLMs.

Some Easy Options to Run Local LLMs

NVIDIA’s ChatRTX: NVIDIA’s ChatRTX is a free application that lets users run a personalized AI chatbot directly on local hardware. By connecting to documents, notes, images, and other data, it provides contextually relevant answers without relying on cloud services, ensuring data stays private. To use ChatRTX, you’ll need a Windows 11 system and an NVIDIA GeForce RTX 30 or 40 Series GPU.

“Oobabooga” Text Generation WebUI: Oobabooga’s Text Generation WebUI is an open-source Gradio-based interface (much like Auto111 WebUI) that allows users to run LLMs directly from their browser. It supports a huge variety of models, which can be imported directly from HuggingFace, offers features like chat and notebook modes, model switching, and extensions for text-to-speech and translation.

LM Studio: LM Studio is a free, open-source Windows, MacOS, and Linux desktop application, designed for offline LLM use. It offers support for a wide range of models (GGUF Llama, Mistral, Phi, Gemma, StarCoder), has direct HuggingFace import, and the ability to “train” on personal documents.

Anything LLM: Anything LLM is an open-source framework by Mintplex Labs that provides an extensible and user-friendly environment for deploying and managing LLMs locally within a web-based interface. Designed to empower enthusiasts with the tools to tailor AI models without relying on cloud-based services.

Jan: Jan is a super-simple open-source app for LLM use, designed to run offline on a wide range of architectures and configurations (NVIDIA GPUs, Apple M-series, Apple Intel, Linux Debian, Windows x64).

GPT4ALL: GPT4ALL from Nomic offers a super-simple lightweight interface for Windows, MacOS, and Ubuntu, with all the popular LLM features; offline chat, open-source model library, ability to query your own data, packaged within a nice, clean, interface.

Backyard.ai: Backyard AI is a web based (and locally running desktop application) dedicated to creating, sharing, and chatting with LLM powered Characters. It has a library of thousands of community defined Characters, and some advanced features like text2speech “voice calling” with your creations.

Using LLMs Effectively

Writing Good Prompts

Section to follow!

Avoiding Common Pitfalls