Update README.md
Browse files
README.md
CHANGED
|
@@ -3,9 +3,7 @@ license: mit
|
|
| 3 |
---
|
| 4 |
## Overview
|
| 5 |
|
| 6 |
-
|
| 7 |
-
|
| 8 |
-
As a result, the Ichigo-llama3s models offer enhanced robustness to noisy environmental inputs and improved handling of multi-turn conversations, ensuring greater reliability and performance in real-world applications.
|
| 9 |
## Variants
|
| 10 |
|
| 11 |
| No | Variant | Cortex CLI command |
|
|
|
|
| 3 |
---
|
| 4 |
## Overview
|
| 5 |
|
| 6 |
+
Developed by Menlo Research, AlphaMaze is a novel model designed to enhance and assess visual reasoning in large language models (LLMs). Unlike approaches that rely on complex image generation, AlphaMaze uses a surprisingly simple task: solving text-based mazes. This requires the LLM to internally reconstruct the maze, plan its path, and strategically reset after dead ends. To further improve AlphaMaze's capabilities, we utilize the GRPO (Generalized Relative Policy Optimization) method. The AlphaMaze model itself offers a richer, more nuanced assessment of spatial understanding than traditional multiple-choice tests.
|
|
|
|
|
|
|
| 7 |
## Variants
|
| 8 |
|
| 9 |
| No | Variant | Cortex CLI command |
|