The 2nd ChatGPT4PCG Competition
Character-like Level Generation for Science Birds
Prompt Engineering Examples
We provide a few example programs implemented using various prompt engineering techniques. These examples are offered to assist you in getting started with the competition. You can utilize these examples as a foundation to develop your own advanced prompt engineering techniques. The examples are accessible in a GitHub repository, which you can find here.
Prompt Rules
- A program submitted to a competition must satisfy the following requirements:
- A submitted program must implement the interface through the Python package provided by the competition to interact with ChatGPT via API and output final results in a specified format. Participants are not allowed to interact with ChatGPT via API through other methods. This is to ensure fairness, as our provided interface will count and monitor the number of tokens used.
- During each evaluation, we will use the latest version of the required competition's Python package available as of one week before the submission deadline.
- Any tools needed for the implementation of prompt engineering techniques are the responsibility of the participants to provide to the organizers. We do not aim to provide any free or paid services required by your program. We do not guarantee the compatibility of additional tools required as a part of your program. Instructions on how to set up these tools must be provided by participants. In case you utilize paid services, such as gated APIs, proprietary software, or specialized databases, it is the participants' responsibility to provide any required information to run the said software and confirm with these providers about the licenses, terms, and agreements for use in this competition. Any parts of the software that could not be made public must be explicitly stated and informed to the organizer to be removed before being made publicly available. We recommend participants check the specifications of the evaluation computer to ensure compatibility beforehand.
- The program must not modify responses from ChatGPT before writing them to files as final output for evaluation. We consider direct intervention to be cheating. Only the direct response from ChatGPT may be utilized as a final output.
- Modification of the message history of ChatGPT in a way that is considered cheating, such as altering the message history with manually created content (i.e., hard-coding answers), is prohibited.
- Modification of the token counter and timer used for the evaluation is prohibited.
- In the event of an error during a trial, that trial will be treated as producing an empty response.
- To ensure fairness, each program combined (source code, tools, databases, etc.) must be at most 1GB in total size including any data downloaded as a result of running the submitted program, and each trial will last only 120 seconds. The total maximum number of tokens that can be used per trial is 25,000 tokens. The sampling temperature and random seed are always fixed at 1 and 42, respectively.
- Automatic prompt optimization may be utilized, but its use during the evaluation is discouraged as it quickly consumes available token limits. Therefore, we suggest employing these techniques beforehand.
- Programs failing to follow the requirements in 1. will result in automatic disqualification.
- To ensure that code blocks can be extracted successfully from responses generated by the ChatGPT API, each output must include three backticks (
```
).- The code extraction script will only extract the content between the last pair of three backticks (
```
). - The extracted code must not contain any loops. Any use of loops will be ignored, resulting in only one instance of the loop's content. The code should not use variables in the call of the
drop_block()
function, as this will result in an error and that response will be skipped for the rest of the evaluation process. To check the behavior of the code extractor, please refer to the Resources page where you can find an online tool for this task. - If no code blocks are present or extracted code results in an empty string, the response will be skipped, and its score will be 0.
- The code extraction script will only extract the content between the last pair of three backticks (
- The use of ChatGPT plugins is not supported, i.e., all plugins and function callings are disabled during the evaluation process.
- The final response from your program must explicitly include a series of
drop_block()
, which will be executed in that order by our tool to generate a character-like structure in a Science Birds level. - The definition of the
drop_block()
function is as follows:- It drops a block vertically drop a block from the top and center it at a specific slot, denoted by
x_position
. - This function works on the following settings:
- A structure is situated on a 2D grid with a width (
W
) of 20 columns and a height (H
) of 16 rows. The grid consists of 320 cells, each of equal size. - Coordinates
(x, y)
are used to represent the positions in the grid, wherex
andy
show the horizontal and vertical indices of cells, respectively. For example,(0, 0)
denotes the bottom-left corner cell of the grid, and(W-1, H-1)
is the top-right corner cell. - A cell on the grid has a size of 1x1. Each cell has unique
(x, y)
coordinates associated with it.
- A structure is situated on a 2D grid with a width (
- This function accepts two parameters:
block_type
: a value that indicates the type of block to be placed. The possible values areb11
,b13
, andb31
. An invalid block type will result in an error.b11
denotes a square block whose size is 1x1.b13
denotes a column block whose size is 1x3.b31
denotes a row block whose size is 3x1.
x_position
: a horizontal index of a grid cell, where0
represents the leftmost column of the grid, andW-1
represents the rightmost column of the grid. Thex_position
parameter indicates the center pivot point of the block being placed. For example, ifb31
is the only block in the level and is placed atx_position=4
, it will occupy cells(3, 0)
,(4, 0)
, and(5, 0)
. An invalid position, like a position where a block of interest intrudes on the grid boundary, will result in an error.
- It drops a block vertically drop a block from the top and center it at a specific slot, denoted by