Pull Request Explorer
| Title | Author | Size | AI | Cycle Time | Review | Merged |
|---|---|---|---|---|---|---|
| fix laguna patch build breakage | dhiltgen | XS | No | 0.3h | 0.0h | Jun 02, 2026 |
SummaryBug FixFix laguna patch build breakage Resolves build failures for the Laguna patch, ensuring stable library exports. Health Assessment
Small
Low
Low
AI Details
Confidence:
0.95
Tech Stack |
||||||
| llm: ignore llama-server SSE ping comments | dhiltgen | XS | No | 0.1h | 0.0h | Jun 02, 2026 |
SummaryBug FixIgnore SSE ping comments in llama-server Fixes streaming behavior by skipping non‑data SSE comment frames, ensuring correct JSON parsing for completion and chat streams. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| docs: configure hermes desktop app | BruceMacD | S | No | 0.3h | 0.1h | Jun 02, 2026 |
SummaryDocsdocs: configure hermes desktop app Provides manual steps to set up the Hermes desktop app for use with Ollama, improving user onboarding. Health Assessment
Small
Low
Low
AI DetailsTech Stack |
||||||
| llama: add laguna (poolside) arch via a llama.cpp patch under llama/c… | dhiltgen | L | No | 25.8h | 25.8h | Jun 02, 2026 |
SummaryFeatureAdd Laguna architecture support via llama.cpp patch Adds support for the Laguna architecture to Ollama's llama.cpp integration, enabling new model variants and improving inference performance for certain workloads. Health Assessment
Large
Medium
Low
AI DetailsTech Stack
Languages:
Cpp
|
||||||
| launch: isolate Codex launch configuration | ParthSareen | XL | No | 0.3h | 0.3h | Jun 02, 2026 |
SummaryFeatureIsolate Codex launch configuration This PR creates a dedicated Codex CLI profile, isolates launch configuration, and adds cleanup for CLI‑owned files, improving maintainability and reducing configuration conflicts. Health Assessment
X-Large
Medium
Low
AI Details
Confidence:
0.80
Tech Stack
Languages:
Go
|
||||||
| llama.cpp version update | dhiltgen | XS | No | 3.1h | 0.5h | Jun 02, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| More harden app markdown URL handling | dhiltgen | S | No | 0.2h | 0.0h | Jun 02, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| llm: detect llama-server load stalls from output | dhiltgen | M | No | 2.3h | 2.3h | Jun 02, 2026 |
SummaryBug FixDetect llama-server load stalls Adds logic to monitor llama-server output to prevent timeouts during model loading, improving reliability. Health Assessment
Medium
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| discover: allow Radeon 8060S iGPU by default | dhiltgen | S | No | 1.4h | 1.2h | Jun 02, 2026 |
SummaryBug FixEnable Radeon 8060S iGPU by default Adds ROCm gfx1151 to allowlist, ensuring the Radeon 8060S integrated GPU is enabled without requiring OLLAMA_IGPU_ENABLE, improving GPU compatibility for users. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| Harden app markdown URL handling | dhiltgen | M | No | 44.7h | 44.6h | Jun 02, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| llm: include cached prompt tokens in llama-server counts | dhiltgen | M | No | 1.2h | 1.1h | Jun 02, 2026 |
SummaryBug FixAdd cached prompt token counts to llama-server Preserves legacy prompt evaluation metrics by combining cached and new prompt tokens, ensuring backward compatibility for Ollama users. This change maintains accurate token accounting for existing workflows. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| launch: fix opencode local model limits | dhiltgen | S | No | 2.9h | 2.8h | Jun 02, 2026 |
SummaryBug FixFix OpenCode local model limits Adjust OpenCode behavior to omit limit stanzas for local models lacking output limits, restoring pre‑0.30 functionality. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| log template details to aid troubleshooting | dhiltgen | M | No | 0.9h | 0.7h | Jun 01, 2026 |
SummaryFeatureAdd detailed template selection logging Enables better troubleshooting by logging template selection details, improving debugging and reliability. Health Assessment
Small
Low
Low
AI Details
Confidence:
0.10
Tech Stack
Languages:
Go
|
||||||
| feat(launch): show and auto-install Cline CLI | hoyyeva | M | No | 1.3h | 0.8h | Jun 01, 2026 |
SummaryFeatureShow and auto-install Cline CLI Adds Cline CLI visibility in the launcher and auto‑install support via npm, with comprehensive tests. Health Assessment
Medium
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| launch: migrate Codex config | ParthSareen | L | No | 1.1h | 1.0h | Jun 01, 2026 |
SummaryBug FixMigrate Codex configuration for launch Updates Codex configuration to improve compatibility and support local model metadata and prompt arguments. Health Assessment
Medium
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| cmd/launch: fix configure cline ollama provider via providers.json | hoyyeva | L | No | 74.3h | 72.6h | Jun 01, 2026 |
SummaryBug FixFix Cline Ollama provider configuration Ensures Cline CLI correctly configures the Ollama provider, enabling seamless local and cloud model usage while preserving legacy compatibility. Health Assessment
Large
High
High
AI Details
Confidence:
0.20
Tech Stack
Languages:
Go
|
||||||
| launch: avoid legacy Codex App profiles | skyloevil | L | No | 40.8h | 40.8h | Jun 01, 2026 |
SummaryBug FixAvoid legacy Codex App profiles Prevents launch failures by removing legacy profile settings, ensuring compatibility with newer Codex versions and maintaining user-selected provider configuration. Health Assessment
Medium
Low
Medium
AI DetailsTech Stack
Languages:
Go
|
||||||
| llama-server followups | dhiltgen | XL | No | 68.3h | 48.1h | Jun 01, 2026 |
SummaryBug FixUpdate llama server and CI for llama.cpp Adds multi‑GPU support, fixes embeddings API, updates CI for llama.cpp, and resolves imagegen loading bug, improving stability and build reliability. Health Assessment
X-Large
High
High
AI Details
Usage:
AI Reviewed
Category:
Code AI
Tools:
ChatGPT
Confidence:
0.95
Tech Stack
Languages:
Go, Yaml, Sh
|
||||||
| convert: fixes for qwen3next model conversion | pdevine | L | No | 67.1h | 0.7h | Jun 01, 2026 |
SummaryBug FixFix Qwen3Next model conversion issues This PR resolves conversion bugs for Qwen3Next models, ensuring correct tensor naming and quantization, improving model compatibility and performance. Health Assessment
Large
Medium
Low
AI Details
Confidence:
0.95
Tech Stack
Languages:
Go
|
||||||
| model: improvements to laguna-xs.2 parser/renderer | jmorganca | L | No | 21.2h | 0.1h | May 31, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| llama: handle Gemma 4 BOS override in llama server | jmorganca | M | No | 14.6h | 0.0h | May 31, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| mlx: fix dev mode search path | dhiltgen | S | No | 0.6h | 0.6h | May 29, 2026 |
SummaryBug FixFix MLX dev mode library search path Corrects library resolution paths after superbuild changes, ensuring MLX runs correctly in development mode. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| mlx: fix reported information in `ollama show` | pdevine | M | No | 0.4h | 0.3h | May 24, 2026 |
SummaryBug FixFix MLX show API information Corrects model metadata display, improving accuracy for users. Health Assessment
Medium
Low
Low
AI Details
Confidence:
0.10
Tech Stack
Languages:
Go
|
||||||
| server: remove duplicate template parsing | Joyel559 | S | No | 2.4h | 1.1h | May 24, 2026 |
SummaryRefactorRemove duplicate template parsing Eliminates redundant template parsing in server code, preserving behavior and adding focused unit tests. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| Revert "mlxrunner: add DFlash speculative decoding (#16134)" | jessegross | XL | No | 20.8h | 4.3h | May 22, 2026 |
SummaryBug FixRevert DFlash speculative decoding integration Reverts an invasive DFlash integration to keep the codebase clean and reduce complexity. Health Assessment
X-Large
Medium
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| launch: enriched model inventory | ParthSareen | XL | No | 48.8h | 30.6h | May 21, 2026 |
SummaryFeatureAdd enriched model inventory to launch Enables richer model metadata for launch, reducing API calls and improving integration efficiency. Health Assessment
X-Large
High
Medium
AI Details
Confidence:
0.75
Tech Stack
Languages:
Go
|
||||||
| Reduce startup model hydration | dhiltgen | XL | No | 23.0h | 18.6h | May 19, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack
Languages:
Go
|
||||||
| codex: omit patch tool type | BruceMacD | XS | No | 1.7h | 1.6h | May 19, 2026 |
SummaryBug FixOmit patch tool type from codex Removing the patch tool type prevents schema compatibility issues, ensuring stable deployments. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| ci: speed up release builds | dhiltgen | L | No | 245.4h | 244.0h | May 15, 2026 |
SummaryCI/CDImprove release build speed and efficiency Optimizes CI release pipeline, reduces build times, improves caching, and lowers resource usage, boosting release velocity and developer experience. Health AssessmentAI Details
Confidence:
0.10
Tech Stack
Frameworks:
Docker, Cmake
|
||||||
| quiet down kv log spew | dhiltgen | XS | No | 92.1h | 91.9h | May 15, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| docs: add codex app docs | ParthSareen | M | No | 0.0h | - | May 15, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| mlxrunner: add DFlash speculative decoding | pdevine | XL | No | 18.9h | 18.9h | May 14, 2026 |
SummaryFeatureAdd DFlash speculative decoding to MLX runner Enables efficient speculative decoding for DFlash blocks, improving inference speed for Qwen3.6 and related models. Health Assessment
X-Large
Medium
Medium
AI DetailsTech Stack
Languages:
Go
|
||||||
| launch: codex app restarts | ParthSareen | L | No | 0.5h | 0.5h | May 14, 2026 |
SummaryBug FixImprove codex app restart reliability Enhances stability of codex app restarts, reducing brittleness and improving user experience. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| launch: update codex app UI copy | ParthSareen | S | No | 0.4h | 0.4h | May 14, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| docs: hide codex app till launch | ParthSareen | XS | No | 0.1h | 0.1h | May 14, 2026 |
SummaryHealth AssessmentAI DetailsTech Stack |
||||||
| mlx: rework the MLX sampler | pdevine | XL | No | 24.3h | 18.4h | May 14, 2026 |
SummaryBug FixRework MLX sampler distribution pipeline Improves sampling efficiency and reproducibility by restructuring the MLX sampler pipeline, enabling sparse token handling and fixing seed behavior. Health Assessment
Large
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| launch: codex app integration | ParthSareen | XL | AI | 26.7h | 24.4h | May 14, 2026 |
SummaryFeatureAdd Codex App Launch Integration Enables users to launch and manage the Codex App directly from the platform, streamlining workflow and expanding integration options. Health Assessment
Large
High
Medium
AI Details
Usage:
Authored by AI
Category:
Code AI
Tools:
Claude
Confidence:
0.95
Tech Stack
Languages:
Go, Typescript
Frameworks:
React
|
||||||
| mlx: add memory trace logging | dhiltgen | XS | No | 0.9h | 0.9h | May 13, 2026 |
SummaryChoreAdd memory trace logging Adds memory trace logging to help diagnose issue #16030. Health Assessment
Small
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| launch/opencode: add image modalities for vision models | hoyyeva | M | No | 265.2h | 245.2h | May 12, 2026 |
SummaryFeatureAdd image modalities for vision models Adds image input capability metadata to OpenCode config for vision‑capable Ollama models, enabling better integration and user experience. Health Assessment
Medium
Medium
High
AI DetailsTech Stack
Languages:
Go
|
||||||
| anthropic: Preserve Claude local image-path tool results in renderer-owned prompt formatting | hoyyeva | XL | AI | 83.2h | 5.6h | May 12, 2026 |
SummaryBug FixPreserve Claude local image-path tool results Ensures images referenced by local paths are correctly handled and displayed in locally served models, improving user experience for image-based prompts. Health Assessment
X-Large
Medium
Medium
AI Details
Category:
Code AI
Confidence:
0.80
Tech Stack
Languages:
Go
|
||||||
| mlx: fix macOS 26 target leakage in v3 metallib | dhiltgen | S | No | 71.6h | 71.6h | May 11, 2026 |
SummaryBug FixFix macOS 26 target leakage in metallib Corrects deployment target leakage in Metal v3 library, ensuring proper macOS compatibility for MLX builds. Health Assessment
Small
Low
High
AI DetailsTech Stack
Languages:
Shell
|
||||||
| mlx: avoid status timeout during inference | dhiltgen | L | No | 24.5h | 24.5h | May 11, 2026 |
SummaryBug FixAvoid status timeout during inference Improves stability of the /v1/status endpoint by caching VRAM usage and preventing long-running inference from blocking health checks, ensuring reliable server health reporting. Health Assessment
Large
Medium
High
AI DetailsTech Stack
Languages:
Go
|
||||||
| mlx: update the imagegen runner for mlx thread affinity | pdevine | L | No | 12.9h | 10.7h | May 11, 2026 |
SummaryBug FixUpdate imagegen runner for mlx thread affinity Enhances image generation by applying mlx thread affinity, improving performance and reliability for users. Health Assessment
Large
Low
Low
AI Details
Confidence:
0.10
Tech Stack
Languages:
Go
|
||||||
| app: harden update flows | dhiltgen | XL | No | 3.4h | 3.1h | May 11, 2026 |
SummaryFeatureHarden update flows Adds opt‑in and unit tests to secure Windows and macOS update processes. Health Assessment
Large
Low
Low
AI DetailsTech Stack
Languages:
Go
|
||||||
| test: integration test hardening | dhiltgen | M | No | - | - | May 08, 2026 |
SummaryTestImprove integration test reliability Enhances test stability on slower systems, reducing flaky failures. Health Assessment
Small
Low
Low
AI Details
Confidence:
0.75
Tech Stack
Languages:
Go
|
||||||
| launch: disable Claude Desktop launch | ParthSareen | L | AI | 20.2h | 0.4h | May 07, 2026 |
SummaryFeatureDisable Claude Desktop launch Removes integration with Claude Desktop due to blocked third-party access for non-anthropic models. Health Assessment
Medium
Low
Low
AI Details
Usage:
AI Assisted
Category:
Code AI
Tools:
Claude
Confidence:
0.90
Tech Stack
Languages:
Go, Typescript, Javascript
Frameworks:
React
|
||||||
| launch: add plan-aware model gating | ParthSareen | XL | No | 0.6h | 0.6h | May 06, 2026 |
SummaryFeatureAdd plan-aware model gating to launch Adds plan-aware gating to ollama launch, providing sign‑in or upgrade hints for cloud models, improving user guidance. Health Assessment
X-Large
Low
Low
AI Details
Confidence:
0.10
Tech Stack
Languages:
Go
|
||||||
| cmd/launch: improve integration backup UX | hoyyeva | L | No | 139.6h | 96.8h | May 06, 2026 |
SummaryFeatureImprove integration backup UX Enhances backup handling for integration configs, keeping only recent backups and removing unnecessary prompts, which streamlines user experience and reduces disk clutter. Health Assessment
Medium
High
Medium
AI DetailsTech Stack
Languages:
Go
|
||||||
| server: cache show responses | ParthSareen | XL | No | 23.8h | 1.5h | May 05, 2026 |
SummaryFeatureAdd server-side cache for /api/show responses Improves model capability lookup speed by caching full responses, reducing latency for launch flows. Health Assessment
X-Large
Low
Low
AI Details
Confidence:
0.75
Tech Stack
Languages:
Go
|
||||||
| mlx: Gemma4 MTP speculative decoding | pdevine | XL | No | 1.2h | 0.3h | May 05, 2026 |
SummaryFeatureAdd Gemma4 MTP speculative decoding support Enables faster inference by allowing multi-token speculative decoding for Gemma4 models, improving performance and reducing latency for end users. Health Assessment
Large
Low
Low
AI DetailsTech Stack
Languages:
Go, Yaml, Sh
|
||||||