Yyuluo-yxchore: add license and newline

48c504a5创建于 2025年10月22日历史提交

文件	最后提交记录	最后更新时间
README.md	1224 yuluo/add ollama (#18) * feat: adjust module name Signed-off-by: yuluo-yx <yuluo08290126@gmail.com> * feat: add ollama chat example Signed-off-by: yuluo-yx <yuluo08290126@gmail.com> * ifx Signed-off-by: yuluo-yx <yuluo08290126@gmail.com> --------- Signed-off-by: yuluo-yx <yuluo08290126@gmail.com>	1 年前
docker-compose.yml	chore: add license and newline	7 个月前

部署 ollama3 并安装模型

部署中为了尽可能减少对本地环境的污染，使用 Docker 安装！

github: https://github.com/ollama/ollama

准备部署文件

version: '3.8'

services:
   ollama:
     volumes:
        # 如果需要可以打开：将本地文件夹挂载到容器中的 /root/.ollama 目录 （模型下载位置）
        - ./models:/root/.ollama  
     container_name: spring-ai-alibaba-ollama
     pull_policy: always
     tty: true
     restart: unless-stopped
     image: ollama/ollama:latest
     ports:
       - 11434:11434  # Ollama API 端口

之后在此根目录下使用 docker compose up -d 拉起容器，等待一段时间之后，docker images pull 成功。即可执行下一步。

下载 LLM 模型

LLM 模型参考：

Model	Parameters	Size	Download
Llama 3	8B	4.7GB	`ollama run llama3`
qwen	4b	2.3G	`ollama run qwen:4b`
Llama 3	70B	40GB	`ollama run llama3:70b`
Phi-3	3,8B	2.3GB	`ollama run phi3`
Mistral	7B	4.1GB	`ollama run mistral`
Neural Chat	7B	4.1GB	`ollama run neural-chat`
Starling	7B	4.1GB	`ollama run starling-lm`
Code Llama	7B	3.8GB	`ollama run codellama`
Llama 2 Uncensored	7B	3.8GB	`ollama run llama2-uncensored`
LLaVA	7B	4.5GB	`ollama run llava`
Gemma	2B	1.4GB	`ollama run gemma:2b`
Gemma	7B	4.8GB	`ollama run gemma:7b`
Solar	10.7B	6.1GB	`ollama run solar`

这里选择最小体积且最好用的模型： llama3:4b 模型。

docker exec -it spring-ai-alibaba-ollama bash

# 进入容器执行
ollama run llama3

成功之后会看到下面这样：

root@c5e5ff20a533:/# ollama run llama3
pulling manifest 
pulling 6a0746a1ec1a... 100% ▕██████████████████████████████████████████████████████████████████████████████████████████████████████████▏ 4.7 GB                         
pulling 4fa551d4f938... 100% ▕██████████████████████████████████████████████████████████████████████████████████████████████████████████▏  12 KB                         
pulling 8ab4849b038c... 100% ▕██████████████████████████████████████████████████████████████████████████████████████████████████████████▏  254 B                         
pulling 577073ffcc6c... 100% ▕██████████████████████████████████████████████████████████████████████████████████████████████████████████▏  110 B                         
pulling 3f8eb4da87fa... 100% ▕██████████████████████████████████████████████████████████████████████████████████████████████████████████▏  485 B                         
verifying sha256 digest 
writing manifest 
removing any unused layers 
success 
>>> 你好
💖 你好！我很高兴地看到你的消息！ 😊

>>> 你能介绍下自己吗
😊 I'd be happy to introduce myself.

My name is LLaMA, and I'm a large language model trained by Meta AI. I'm a computer program designed to understand and generate human-like text, so we can have 
conversations like this one! 🤖

I was trained on a massive dataset of text from the internet, which allows me to learn about various topics, including history, science, culture, and more. This 
training enables me to answer questions, provide information, and even engage in creative writing or storytelling.

As a conversational AI, my goal is to assist and entertain users like you. I'm designed to be helpful, friendly, and respectful, so please feel free to ask me 
anything or share your thoughts with me! 💬

访问

上面已经介绍了一种访问方式，通过 run 的方式。下面介绍 api 方式访问。

API

curl http://localhost:11434/api/generate -d '{
    "model":"llama3",
    "prompt": "请分别翻译成中文、韩文、日文 -> Meta Llama 3: The most capable openly available LLM to date",
    "stream": false
}'

curl http://localhost:11434/api/chat -d '{
  "model": "llama3",
  "messages": [
    {
      "role": "user",
      "content": "why is the sky blue?"
    }
  ],
  "stream": true
}'