FallenMerick
/

Chewy-Lemon-Cookie-11B

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

Chewy-Lemon-Cookie-11B

This is a merge of pre-trained language models created using mergekit.

GGUF quants:

Merge Details

Merge Method

This model was merged using the following methods:

passthrough
task arithmetic

Models Merged

The following models were included in the merge:

Configuration

The following YAML configurations were used to produce this model:

slices:
  - sources:
    - model: SanjiWatsuki/Kunoichi-7B
      layer_range: [0, 24]
  - sources:
    - model: SanjiWatsuki/Silicon-Maid-7B
      layer_range: [8, 24]
  - sources:
    - model: KatyTheCutie/LemonadeRP-4.5.3
      layer_range: [24, 32]
merge_method: passthrough
dtype: bfloat16
name: Big-Lemon-Cookie-11B-BF16

---

models:
  - model: Big-Lemon-Cookie-11B-BF16
    parameters:
      weight: 0.85
  - model: Sao10K/Fimbulvetr-11B-v2
    parameters:
      weight: 0.15
merge_method: task_arithmetic
base_model: Big-Lemon-Cookie-11B-BF16
dtype: bfloat16
name: Chewy-Lemon-Cookie-11B

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	21.91
IFEval (0-Shot)	48.75
BBH (3-Shot)	33.01
MATH Lvl 5 (4-Shot)	4.61
GPQA (0-shot)	3.91
MuSR (0-shot)	15.95
MMLU-PRO (5-shot)	25.19

Downloads last month: 53

Safetensors

Model size

11B params

Tensor type

BF16

·

Model tree for FallenMerick/Chewy-Lemon-Cookie-11B

KatyTheCutie/LemonadeRP-4.5.3

SanjiWatsuki/Kunoichi-7B

SanjiWatsuki/Silicon-Maid-7B

Sao10K/Fimbulvetr-11B-v2

Merge model

this model

Quantizations

Collection including FallenMerick/Chewy-Lemon-Cookie-11B

Mistral-7B

5 items • Updated Nov 7, 2024 • 1

Paper for FallenMerick/Chewy-Lemon-Cookie-11B

Editing Models with Task Arithmetic

Paper • 2212.04089 • Published Dec 8, 2022 • 8

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

48.750
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

33.010
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

4.610
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

3.910
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

15.950
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

25.190