quietlight/skraak_mcp - Change SQZWTGAV6XDIFU42C5KIBHF5GIGPTXB6552Q7FXQCGHYPGQLFB5QC

deleted selection_metadata table and replaced it with label_metadata table

Created by quietlight on February 24, 2026

SQZWTGAV6XDIFU42C5KIBHF5GIGPTXB6552Q7FXQCGHYPGQLFB5QC

Dependencies

In channels

main

Change contents

Replacement in tools/export.go at line 55 [3.1]

B:BD[3.2257] → [3.2257:2362]

	{Table: "selection_metadata", Relation: "owned-via", FilterCol: "selection_id", ViaTable: "selection"},

[3.2257]

[3.2362]

	{Table: "label_metadata", Relation: "owned-via", FilterCol: "label_id", ViaTable: "label"},

Replacement in resources/schema.go at line 25 [4.114683]
B:BD[4.115030] → [4.115030:115053]
```
	"selection_metadata",
```
[4.115030]
[4.115053]
```
	"label_metadata",
```

Replacement in resources/schema.go at line 55 [4.114683]

B:BD[4.115877] → [5.1226:1495]

		Description: "SQL schema for a specific table. Available tables: dataset, location, cyclic_recording_pattern, cluster, file, moth_metadata, file_metadata, file_dataset, selection, selection_metadata, ebird_taxonomy, species, call_type, filter, label, label_subtype",

[4.115877]

[4.116185]

		Description: "SQL schema for a specific table. Available tables: dataset, location, cyclic_recording_pattern, cluster, file, moth_metadata, file_metadata, file_dataset, selection, label_metadata, ebird_taxonomy, species, call_type, filter, label, label_subtype",

Deletion in db/schema.sql at line 133 [4.305331]

∅:D[6.148502] → [4.314267:314271]

B:BD[4.314267] → [4.314267:314271]

B:BD[4.314444] → [4.314444:314520]

B:BD[4.314520] → [6.148503:148519]

∅:D[6.148519] → [4.314595:314821]

B:BD[4.314595] → [4.314595:314821]

);
CREATE TABLE selection_metadata (
    selection_id VARCHAR(21) PRIMARY KEY,
    json JSON, 
    created_at TIMESTAMP WITH TIME ZONE DEFAULT CURRENT_TIMESTAMP,
    last_modified TIMESTAMP WITH TIME ZONE DEFAULT CURRENT_TIMESTAMP,
    active BOOLEAN DEFAULT TRUE,
    FOREIGN KEY (selection_id) REFERENCES selection(id)

Replacement in db/schema.sql at line 188 [4.305331]

∅:D[6.148868] → [4.317577:317604]

B:BD[4.317577] → [4.317577:317604]

    filter_id VARCHAR(12),

[6.148868]

[4.317604]

    filter_id VARCHAR(12) NOT NULL, -- Note, not null constraint is not enforced by db yet but will be next time it is exported and reimported

Insertion in db/schema.sql at line 196 [4.305331]

[4.318015]

);
CREATE TABLE label_metadata (
 label_id VARCHAR(21) PRIMARY KEY,
 json JSON,
 created_at TIMESTAMP WITH TIME ZONE DEFAULT CURRENT_TIMESTAMP,
 last_modified TIMESTAMP WITH TIME ZONE DEFAULT CURRENT_TIMESTAMP,
 active BOOLEAN DEFAULT TRUE,
 FOREIGN KEY (label_id) REFERENCES label(id)

File addition: avianz_file_format_specification.md (----------)

[4.161589]

# Specification of file formats used by AviaNZ
AviaNZ annotations and filter definitions are stored in JSON format to allow easy parsing and manual inspection by text editors.
## Annotation files (.data)
A JSON array where the first (optional, but recommended) element stores metadata about the corresponding audio file, and each remaining element corresponds to a segment:
    [ Meta, seg, seg, seg, seg ... ]
`Meta`: a JSON object (key-value pairs) containing any metadata. Required fields:  
`Operator` - string  
`Reviewer` - string  
`Duration` - numeric, audio file length, in seconds  
...
Each true segment `seg` is a JSON array containing five elements, all required:
    [ starttime, endtime, freq.low, freq.high, labels ]
    
`startime, endtime` - segment start and end positions, in seconds, relative to start of file as 0.  
`freq.low, freq.high` - for annotation boxes, frequency band in Hz. For segments (full-band annotations), both `0`. If both `0<freq<1`, old format is assumed, and treated as full-band segment (`0,0`).  
`labels` - a JSON array of labels for each type of sound detected:
    [ label, label, label... ]
    
where each `label` is a JSON object, having some of the following fields:
    { "species": "Kiwi (Little spotted)", "certainty": 0, "filter": "kiwi-best", "calltype": "f1", ... }
    
`species` - string, either `"genus (species)"` or just plain `"species"`. May be `"Don't Know"` or any other label (`"Bellbird/Tui"`, `"Fantail (spp)"`...), except for the internal genus separator `>`. Required.  
`certainty` - numeric between 0 and 100. Currently, for `"species": "Don't Know"` only `0` allowed, `100` corresponds to green segments, and `50` corresponds to question marks in earlier formats. `(species, certainty)` defines a unique key for labels. Required.  
`filter` - string, name of the filter file that created this label, or `"M"` for manual annotations.  
`calltype` - string, to identify the call type. Call types can be annotated manually, or will be automatically generated from clusters during filter training. Required for automatic filters (i.e. if `filter` is not empty or `"M"`).  
Any additional attributes defined for this call (male/female, subjective loudness...) are optional and can be passed as key-value pairs.
Thus, a full .data file may look like this:
    [ {"Operator": Alice, "Reviewer": Bob, "Duration": 60.0, "Noise": "windy"},    // metadata
      // a manually marked box
      [1.0, 19.0, 1200, 2500,
        [
          { "species": "Kiwi (Little spotted)", "certainty": 100, "filter": "M", "loudness": 3 }
        ]
      ],
      // box from a "trill" filter
      [21.0, 23.0, 800, 6000,
        [
          { "species": "Morepork", "certainty": 50, "filter": "ruru-90-10", "calltype": "trill" }
        ]
      ],
      // a manually marked segment with morepork and something else
      [35, 45, 0, 0,
        [
          { "species": "Morepork", "certainty": 100, "filter": "M" },
          { "species": "Don't Know", "certainty": 0, "filter": "M" }
        ]
      ]
    ]
## Filter files (.txt)
A JSON array:
    { "species": "Kiwi (Little spotted)", "SampleRate": 16000, "Filters": [], "NN": {}, ...}
    
Main filter ID is the file name because this automatically ensures that no duplicate IDs are present at any installation of AviaNZ. This name can be any string permitted by the OS, and no further information is gathered from it.  
`species` - string. This label will be assigned as the `species` in segments generated by this filter. Can follow `"genus (species)"` format as described above. Required.  
`SampleRate` - integer. All analyses will be done after down-(up-)sampling to this rate. Required.   
`method` - string, `"wv"` or `"chp"`. Empty defaults to `"wv"`.  
Any extra parameters to be applied for all subfilters may be provided (such as `"wind"`).  
`Filters` - JSON array of filters corresponding to each type of call (at least one element). Each is a JSON object:
    { "calltype": "clust1", "TimeRange": [min call length, max call length, avg syllable length, max gap between syllables], "WaveletParams": {"thr": 0.5, "M": 1.5, "nodes": [35, 37, 40]}, "FreqRange": [1000, 3000], ... }
    
`calltype` - either user-defined call type, or automatically generated cluster ID. String. Required.   
`TimeRange` - JSON array of length 4: `[minlen, maxlen, avgsyl, maxgap]`, respectively min and max lengths of a call, average syllable length, and maximum gap between parts of same call. Required.   
`WaveletParams` - JSON object of parameters needed for wavelet filtering. Required. Currently uses:  
* `thr` - numeric, threshold for detecting calls. Required.  
* `nodes` - JSON array of wavelet nodes used in this filter. Required.  
* `M` - numeric, energy curve window in seconds. Required for `method="wv"`.  
* `win` - numeric, window for energy averaging in seconds. Required for `method="chp"`.
`FreqRange` - frequency band for analysis. Identified calls will be marked as boxes with these limits, or as full-band segments if not provided.
Any extra subfilter parameters may follow, such as `"F0"`.
`PostResolution` - numeric. If present, detections will be merged and resplit into pieces of this many seconds (i.e. this parameter is both the merging gap and split piece length).
`NN` - JSON object. Meta information about the Convolution Neural Network (NN) model for this species:
    "NN": {"NN_name": "Kiwi (Nth Is Brown)", "loss": "binary_crossentropy", "optimizer": "adam", "win": 0.25, "inputdim": [128, 30], "output": {"0": "Male", "1": "Female", "2": "Noise"}}
If present, all the following are required:  
* `NN_name` - File name of the model, e.g. `Kiwi (Nth Is Brown).json` and `Kiwi (Nth Is Brown).h5` or `Kiwi (Nth Is Brown).weights.h5`.   
* `loss` - loss function.   
* `optimizer` - optimisation algorithm.   
* `win` - input image width in seconds.   
* `inputdim` - input dimension in pixels.   
* `output` - the output classes/labels.   
* `windowInc` - window width and increment.   
* `thr`- threshold for each call type.  
Thus, a full filter file may look like this:
    { "species": "Kiwi (Little spotted)", "SampleRate": 16000, "Rain": false, "Wind": true,
      "Filters": [
        { "calltype": "M", "TimeRange": [5, 60, 1, 3], "WaveletParams": {"nodes": [44, 45, 46], "thr": 0.5, "M": 1.5}, "F0": true, "FreqRange": [1500, 5000] },
        { "calltype": "F", "TimeRange": [10.0, 30.0, 0.8, 1.0], "WaveletParams": {"nodes": [41, 44], "thr": 0.8, "M": 2}, "FreqRange": [1000, 2500] }
      ],
      "NN": {"NN_name": "Kiwi (Little spotted)", "loss": "binary_crossentropy", "optimizer": "adam", "win": 0.25, "inputdim": [128, 30], "output": {"0": "M", "1": "F", "2": "Noise", "3": "Silence"}, "windowInc":[256, 128], "thr":[0.5, 0.3]}
    }
## NN files (.JSON/.h5/.hdf5)
A NN model has two files: model architecture is stored in a JSON file and the weights are stored in a Hierarchical Data Format 5 file (.h5 or .hdf5).
All the NN models are stored in the user configdir/Filters and referred in the corresponding Filter files.
## Correction files (.corrections/ .corrections_species)
All Species Review mode generates .corrections:
A JSON array where the first element stores metadata, and each remaining element corresponds to a segment changed by reviewer:
    [ Meta, [seg, newlabel], [seg, newlabel], [seg, newlabel] ... ]
`Meta`: a JSON object (key-value pairs) containing any metadata, same as in .data.
`seg`: Each segment seg is a JSON array containing five elements, same as in .data.
`newlabel`: New label/s assigned to the segment by the reviewer.
Single Species Review mode generates .corrections_species:
A JSON array where the first element stores metadata, and each remaining element corresponds to a segment deleted by reviewer:
    [ Meta, seg, seg, seg ... ]
`Meta`: a JSON object (key-value pairs) containing any metadata, same as in .data.
`seg`: Each segment seg is a JSON array containing five elements, same as in .data.

Replacement in CLAUDE.md at line 1 [4.363912]

B:BD[4.363912] → [4.363913:363951]

B:BD[4.363951] → [7.680:759]

# Claude's Notes - Skraak MCP Server
Essential reminders and best practices for the Skraak CLI/MCP Server codebase.

[4.363912]

[8.72538]

# Skraak CLI/MCP Server

Replacement in CLAUDE.md at line 9 [4.363912]
B:BD[7.1015] → [7.1015:1081]
```
- This file is expensive (loaded every session) - keep it concise
```
[7.1015]
[8.72789]
```
- **keep it concise**
```
Replacement in CLAUDE.md at line 23 [4.363912]
B:BD[8.74176] → [7.1177:1188]
∅:D[7.1188] → [4.364265:364273]
B:BD[4.364265] → [4.364265:364273]
B:BD[4.364273] → [7.1189:1249]
∅:D[7.1249] → [4.364660:364665]
B:BD[4.364660] → [4.364660:364665]
B:BD[4.364665] → [7.1250:1317]
```
**WRONG:**
```bash
./test_sql.sh  # Uses skraak.duckdb by default - DANGEROUS!
```
- `db/skraak.duckdb` = **PRODUCTION** (1.19M files, 139 locations)
```
[8.74176]
[7.1317]
```
- `db/skraak.duckdb` = **PRODUCTION** (1.4M files)
```

Deletion in CLAUDE.md at line 59 [4.363912]

B:BD[4.366528] → [7.2507:2720]

∅:D[7.2720] → [4.366723:366724]

B:BD[4.366723] → [4.366723:366724]

**Philosophy:** Schema + Generic SQL > Specialized Tools
- LLMs construct queries from schema (infinite flexibility)
- Full SQL expressiveness (JOINs, aggregates, CTEs)
- Prompts teach SQL patterns, not tool APIs

Deletion in CLAUDE.md at line 125 [4.363912]
∅:D[7.5449] → [9.27399:27400]
B:BD[4.368974] → [9.27399:27400]
B:BD[9.27400] → [7.5450:5454]
∅:D[7.5454] → [9.27433:27434]
B:BD[9.27433] → [9.27433:27434]
B:BD[9.27434] → [7.5455:5488]
∅:D[7.5488] → [9.27522:27523]
∅:D[10.20391] → [9.27522:27523]
B:BD[9.27522] → [9.27522:27523]
B:BD[9.27523] → [7.5489:5761]
∅:D[7.5761] → [4.368974:368975]
∅:D[9.27724] → [4.368974:368975]
B:BD[4.368974] → [4.368974:368975]
B:BD[4.368975] → [7.5762:5766]
```
---
## Security for execute_sql tool
- Database **read-only** (`db/db.go:27` appends `?access_mode=read_only`)
- Validation: Regex (SELECT/WITH only) + forbidden keywords
- Parameterized queries prevent SQL injection
- Application-level validation: ID format, numeric bounds, string lengths, entity existence
---
```
Deletion in CLAUDE.md at line 144 [4.363912]
∅:D[7.6251] → [4.369426:369427]
B:BD[4.369426] → [4.369426:369427]
B:BD[4.369427] → [7.6252:6268]
∅:D[7.6268] → [4.369976:369977]
B:BD[4.369976] → [4.369976:369977]
B:BD[4.370006] → [4.370006:370013]
B:BD[4.370013] → [7.6269:6348]
∅:D[7.6348] → [4.370114:370115]
B:BD[4.370114] → [4.370114:370115]
B:BD[4.370115] → [7.6349:6469]
∅:D[7.6469] → [4.370344:370345]
B:BD[4.370344] → [4.370344:370345]
B:BD[4.370345] → [7.6470:6492]
∅:D[7.6492] → [4.370387:370394]
B:BD[4.370387] → [4.370387:370394]
B:BD[4.370394] → [7.6493:6607]
∅:D[7.6607] → [4.370534:370699]
B:BD[4.370534] → [4.370534:370699]
B:BD[4.371431] → [4.371431:371448]
```
## SQL Examples
```sql
-- Basic query
SELECT id, name FROM dataset WHERE active = true ORDER BY name;
-- Parameterized (use execute_sql with parameters array)
SELECT * FROM location WHERE dataset_id = ? AND active = true;
-- JOINs + aggregates
SELECT
    d.name,
    COUNT(DISTINCT l.id) as locations,
    COUNT(DISTINCT c.id) as clusters,
    COUNT(f.id) as files
FROM dataset d
LEFT JOIN location l ON d.id = l.dataset_id
LEFT JOIN cluster c ON l.id = c.location_id
LEFT JOIN file f ON c.id = f.cluster_id
WHERE d.active = true
GROUP BY d.name;
```
Replacement in CLAUDE.md at line 145 [4.363912]
B:BD[4.371453] → [7.6608:6629]
∅:D[7.6629] → [4.371483:371610]
B:BD[4.371483] → [4.371483:371610]
B:BD[4.371610] → [7.6630:6723]
∅:D[7.6723] → [4.371705:371709]
B:BD[4.371705] → [4.371705:371709]
```
-- Temporal analysis
SELECT
    DATE_TRUNC('day', timestamp_local) as day,
    COUNT(*) as recordings,
    SUM(duration) as total_seconds
FROM file
WHERE active = true AND timestamp_local >= '2024-01-01'
GROUP BY day ORDER BY day LIMIT 100;
```
```
[4.371453]
[4.371709]
```
## SQL 
```

Replacement in CLAUDE.md at line 147 [4.363912]

B:BD[4.371710] → [7.6724:6888]

**Best practices:** Always `WHERE active = true`, use parameterized queries for IDs, use `LEFT JOIN` to include parent records, use `COUNT(DISTINCT)` when joining.

[4.371710]

[4.371783]

**Best practices:** `WHERE active = true`, use parameterized queries for IDs, use `LEFT JOIN` to include parent records, use `COUNT(DISTINCT)` when joining.

Deletion in CLAUDE.md at line 196 [4.363912]

B:BD[4.372371] → [7.8033:8123]

# Core functionality
./get_time.sh                                    # Time tool (no DB)

Deletion in CLAUDE.md at line 197 [4.363912]
∅:D[7.8182] → [4.372404:372405]
B:BD[4.372404] → [4.372404:372405]
B:BD[4.372405] → [7.8183:8247]
∅:D[7.8247] → [4.372629:372630]
B:BD[4.372629] → [4.372629:372630]
B:BD[4.372630] → [7.8248:8375]
∅:D[7.8375] → [4.372691:372692]
∅:D[8.76596] → [4.372691:372692]
B:BD[4.372691] → [4.372691:372692]
B:BD[4.372692] → [7.8376:8507]
```
# Write tools
./test_tools.sh ../db/test.duckdb > test.txt 2>&1
# Import tools
./test_import_file.sh ../db/test.duckdb > test.txt 2>&1
./test_bulk_import.sh ../db/test.duckdb > test.txt 2>&1
# Resources/prompts
./test_resources_prompts.sh ../db/test.duckdb | jq '.'
./test_all_prompts.sh ../db/test.duckdb > test.txt 2>&1
```
Deletion in CLAUDE.md at line 210 [4.363912]
B:BD[4.372985] → [4.372985:372986]
B:BD[4.372986] → [7.8885:8913]
```
170+ tests, 91.5% coverage.
```
Deletion in CLAUDE.md at line 220 [4.363912]
B:BD[11.25091] → [11.25091:25140]
```
- Includes tool name, SQL, parameters, timestamp
```

Replacement in CLAUDE.md at line 255 [4.363912]

B:BD[7.9080] → [7.9080:9234]

- **Database connection failed:** Check path exists and is readable
- **SQL syntax error:** Check query syntax, table/column names (use schema resources)

[7.9080]

[8.77810]

- **SQL syntax error:** use schema resources

Deletion in CLAUDE.md at line 275 [4.363912]

B:BD[12.29277] → [12.29277:29281]

B:BD[8.79786] → [8.79786:79787]

B:BD[8.79787] → [7.9327:9346]

∅:D[7.9346] → [13.70718:70719]

B:BD[13.70718] → [13.70718:70719]

B:BD[13.70719] → [3.44506:44555]

∅:D[11.25546] → [7.9397:9463]

∅:D[3.44555] → [7.9397:9463]

B:BD[7.9397] → [7.9397:9463]

B:BD[7.9463] → [14.731:815]

∅:D[14.815] → [15.423:504]

B:BD[7.9514] → [15.423:504]

∅:D[15.504] → [11.25643:25723]

∅:D[3.44662] → [11.25643:25723]

B:BD[11.25643] → [11.25643:25723]

B:BD[11.25723] → [3.44663:44796]

∅:D[11.25771] → [7.9648:9972]

∅:D[3.44796] → [7.9648:9972]

B:BD[7.9648] → [7.9648:9972]

∅:D[7.9972] → [13.70798:70799]

B:BD[13.70798] → [13.70798:70799]

B:BD[13.70799] → [3.44797:44831]

---
## Quick Reference
**Status:** Dataset export complete (2026-02-19)
**Architecture:** Two-layer (tools=MCP-free, cmd/mcp.go=adapters)
**Tools:** 10 MCP tools (read: 2, write: 4, import: 4) + 2 CLI-only import commands
**CLI Commands:** `mcp`, `sql`, `create`, `update`, `import`, `export`, `replay`
**Event Log:** SQL-level mutation capture for backup sync (`<db>.events.jsonl`)
**Dataset Export:** Full dataset export with FK traversal (`skraak export dataset`)
**Test Scripts:** 10 comprehensive shell scripts
**Test Coverage:** 170+ Go unit tests (91.5%)
**Import Logic:** Centralized in `utils/cluster_import.go` (553 lines)
**Timestamp Fallback:** AudioMoth → Filename → FileModTime
**Databases:** `skraak.duckdb` (production ⚠️), `test.duckdb` (testing ✅)
**Current Data:** 1.19M files, 139 locations, 8 active datasets
**Last Updated:** 2026-02-19 NZDT

Insertion in .ignore at line 16 [16.1]
[2.11353]
```
dasel
gum
```

deleted selection_metadata table and replaced it with label_metadata table

Dependencies

In channels

Change contents

Replacement in tools/export.go at line 55 [3.1]

Replacement in resources/schema.go at line 25 [4.114683]

Replacement in resources/schema.go at line 55 [4.114683]

Deletion in db/schema.sql at line 133 [4.305331]

Replacement in db/schema.sql at line 188 [4.305331]

Insertion in db/schema.sql at line 196 [4.305331]

File addition: avianz_file_format_specification.md (----------)

Replacement in CLAUDE.md at line 1 [4.363912]

Replacement in CLAUDE.md at line 9 [4.363912]

Replacement in CLAUDE.md at line 23 [4.363912]

Deletion in CLAUDE.md at line 59 [4.363912]

Deletion in CLAUDE.md at line 125 [4.363912]

Deletion in CLAUDE.md at line 144 [4.363912]

Replacement in CLAUDE.md at line 145 [4.363912]

Replacement in CLAUDE.md at line 147 [4.363912]

Deletion in CLAUDE.md at line 196 [4.363912]

Deletion in CLAUDE.md at line 197 [4.363912]

Deletion in CLAUDE.md at line 210 [4.363912]

Deletion in CLAUDE.md at line 220 [4.363912]

Replacement in CLAUDE.md at line 255 [4.363912]

Deletion in CLAUDE.md at line 275 [4.363912]

Insertion in .ignore at line 16 [16.1]