Result

πŸŽ‰
Congratulations to the competition winners!
πŸ₯‡ 1st Place - “HFLN”
πŸ₯ˆ 2nd Place - “Z2A”
πŸ₯‰ 3rd Place - “FNRX”
Evaluation Leaderboard
RankTeam NameTotal Prompt ScoreNormalized Score
1HFLN0.097744.96
2Z2A0.034515.88
3FNRX0.022110.19
4ChatZero0.01617.42
5CARRASCO_VELO_Pablo0.01547.11
6MY_Team0.01215.57
7jaidh0.00884.04
-Strong Baseline (Zero-Shot Multi-turn)0.00843.87
-Weak Baseline (Zero-Shot)0.00210.96
DisMyAwesomeTeam--

Dis means disqualified.
MyAwesomeTeam is disqualified due to not using our provided package, llms4pcg.chat_with_llm, for interacting with the model.

Model-Specific Team Rankings
RankTeam NameModelPrompt ScoreNormalized Prompt Score
1HFLNgemma 20.038717.80
2HFLNqwen 2.50.038617.76
3HFLNphi 30.02049.40
4Z2Aphi 30.01687.74
5FNRXqwen 2.50.01416.47
6Z2Aqwen 2.50.00954.39
7Z2Agemma 20.00813.75
8CARRASCO_VELO_Pabloqwen 2.50.00793.62
9ChatZeroqwen 2.50.00773.56
10ChatZerogemma 20.00572.62
11FNRXgemma 20.00562.56
12CARRASCO_VELO_Pablogemma 20.00542.50
13MY_Teamgemma 20.00522.39
14MY_Teamqwen 2.50.00522.38
15Zero-Shot Multi-turnqwen 2.50.00512.36
16jaidhgemma 20.00502.29
17jaidhqwen 2.50.00381.74
18Zero-Shot Multi-turngemma 20.00291.35
19ChatZerophi 30.00271.24
20FNRXphi 30.00251.16
21CARRASCO_VELO_Pablophi 30.00210.98
22MY_Teamphi 30.00170.80
23Zero-Shotgemma 20.00100.45
24Zero-Shotqwen 2.50.00090.43
25Zero-Shot Multi-turnphi 30.00030.16
26Zero-Shotphi 30.00020.08
27jaidhphi 30.00000.00
Evaluation Configuration

Parameters used during final evaluation:

  • Competition package: llms4pcg-python version 2.0.1
  • Starting seed for LLM interaction (Ollama): 6280
  • Random seed for programming package: 42
Download Competition Data

Competition related data can be downloaded from the following link (August 29, 2025): OSF Storage

    • character_scores.csv
    • constants.json - Weights and constant values.
    • final_team_rankings.csv
    • prompt_scores_ranks.csv
    • trial_scores.csv
      • zero_shot_multi_turn.zip
      • zero_shot.zip
    • CARRASCO_VELO_PABLO.zip
    • ChatZero.zip
    • FNRX.zip
    • HFLN.zip
    • jaidh.zip
    • MyAwesomeTeam.zip
    • MY_Team.zip
    • Z2A.zip
  • competition.zip - Generated data from the evaluation process.