References

About 10 min

References 관련

Google

Colab

pdf(s)

OLMo : Accelerating the Science of Language Models

X

ariG23498 / A simple hack to calculating how much VRAM you would need to run a model.
mervenoyann / NVIDIA just dropped a gigantic multimodal model called NVLM 72B 🦖
danielhanchen / Llama 3.2 Multimodal benchmarks
ArtificialAnlys / OpenAI’s o1-preview is the first model to substantially push the frontier of language model intelligence since the original GPT-4 over 18 months ago
helloiamleonie / Chunking techniques for RAG:
akshay_pachaar / Multimodal Retrieval Augmented Generation (RAG), clearly explained:
weaviate_io / 6 types of vector embeddings for your AI applications

NVIDIA

NVIDIA Releases Open Synthetic Data Generation Pipeline for Training Large Language Models

Medium

velog

Substack

Brunch

inblog

graphwoody / From RAG to GraphRAG , What is the GraphRAG and why i use it?

Replicate

A comprehensive guide to running Llama 2 locally

MetaAI

Google

Colab

Ollama

Linkedin

How to Fit Large Language Models in Small Memory: Quantization

Finbarr Timbers

How is LLaMa.cpp possible?

fast.ai

Can LLMs learn from a single example?

replit

I'm not a programmer, and I used AI to build my first bot

Second State

Fast and Portable Llama2 Inference on the Heterogeneous Edge

Bbycroft

LLM Visualization

Justine

Bash One-Liners for LLMs
- jart/rename-pictures.sh

moomou

Listening with LLM

Trail of Bits

LeftoverLocals: Listening to LLM responses through leaked GPU local memory

2MB codes

cmb/ollama-bot: This is a rudimentary IRC bot that communicates with a local instance of ollama.

Shyam's Blog

Beyond Self-Attention: How a Small Language Model Predicts the Next Token

Gradient Descent into Madness

LLM from scratch: Automatic Differentiation

Hamel

Fuck You, Show Me The Prompt.

Ahead of AI

kapa.ai

Optimizing Technical Docs for LLMs

Cloudflare

Cloudflare announces Firewall for AI

Martin Lumiste

Compressing images with neural networks

Brandon's Digital Garden

Building an email to calendar LLM

Justine Tunney's Web Page

LLaMA Now Goes Faster on CPUs

hiddenest

llm은 모르지만 뭔가 만들고 싶어서

tistory

Real Python

정우일 블로그

GGUF 파일로 로컬에서 LLM 실행하기

Daddy Maker

Outerbounds

The Many Ways to Deploy a Model

ChristopherGS

Running Open Source LLMs In Python - A Practical Guide

Simon Willison's TILs

Allen Pike

LLMs Aren’t Just “Trained On the Internet” Anymore

냉동코더의 기술블로그

macOS에서 Ollama 사용하기

Applied LLMs

What We’ve Learned From A Year of Building with LLMs

OranLooney.com

A Picture is Worth 170 Tokens: How Does GPT-4o Encode Images?

lytix.ai

Cost Of Self Hosting Llama-3 8B-Instruct

`llama.ttf`

llama.ttf

Eugene Yan

Patterns for Building LLM-based Systems & Products

Alex Strick van Linschoten

How to think about creating a dataset for LLM finetuning evaluation

imbue

From bare metal to a 70B model: infrastructure set-up and scripts

Jay Alammar

The Illustrated Transformer

The Missing Notes

포티투닷 | 42dot - We Are A Mobility AI Company

42dot LLM 1.3B

Haandol, TL;DR

비즈니스, 테크놀로지, 리더십 - CIO Korea

테크놀로지 리더를 위한 글로벌 IT 뉴스 - ITWorld Korea

huffon / What if...? 처음부터 다시 LLM 어플리케이션을 개발한다면

Dable Tech Blog

언어 모델의 Fine-Tuning 성능 올리기

Augmend

TreeSeg: Hierarchical Topic Segmentation at Augmend

Min Hsu

Scheduling Model in LLVM - Part I

Chip Huyen

Building A Generative AI Platform

Codesolvent Blog

Declarative Programming With AI/LLMs

최윤섭의 디지털 헬스케어

생성형 의료 인공지능을 ‘새로운 지적 존재’로서 규제하자

Timescale Blog

RAG Is More Than Just Vector Search

000namc.blog

Sequoia Capital

Generative AI’s Act o1 - 에이전트 추론의 시대 개막

000namc.xyz

TensorMSA

Jason Kang

What Is Direct Parameter Optimization(DPO)?

Pega Devlog

DSChloe