0

Openfarm Bioacustics RL Env (Openfarm)

Fresh

OpenFARM bioacoustic classification suite for animal vocalization benchmarks

Type
RL Env
Publisher
Openfarm
Runtime
single-turn
License
unknown
Size
v0.1.0
Published
May 2026

Cite

Notes

Only stored in your browser.

openfarm-bioacustics

GitHub Prime Intellect Environments Hub


Overview

OpenFARM Bioacoustics is a task-registry environment for animal vocalization benchmarks. It keeps each source dataset/task explicit, while exposing a single SoundWel-like interface for audio, spectrogram, multimodal, and text/acoustic metadata ablations.

  • Environment ID: openfarm-bioacustics
  • Type: single-turn classification / EnvGroup when multiple tasks are selected
  • Modalities: audio, vision, multimodal, text
  • Output format: XML answer, with optional explanation
  • Primary metric: exact normalized answer reward

Vision mode uses a source spectrogram column when one exists. For audio-only datasets, the environment generates a compact spectrogram from the audio at load time.

Tasks

TaskDatasetLabel
soundwel_valenceoliveirabruno01/soundwel-pig-vocalizationsPositive / Negative
soundwel_contextoliveirabruno01/soundwel-pig-vocalizationspig vocalization context
soundwel_call_typeoliveirabruno01/soundwel-pig-vocalizationsHF / LF
soundwel_ageoliveirabruno01/soundwel-pig-vocalizationspig age category
catmeows_contextoliveirabruno01/openfarm-catmeowsfeline source context
laying_hen_stress_contextoliveirabruno01/openfarm-laying-hen-stressexperimental stress context
laying_hen_stress_binaryoliveirabruno01/openfarm-laying-hen-stressstress-response binary
ungulate_valenceoliveirabruno01/openfarm-ungulate-valencePositive / Negative

Source Attribution

DatasetPrepared HF DatasetSource Dataset / Paper
SoundWel pig vocalizationsoliveirabruno01/soundwel-pig-vocalizationsThe Soundwel Database: a labeled pig vocalization repository, Zenodo record 8252482, DOI 10.1038/s41598-022-07174-8, CC-BY-4.0
CatMeowsoliveirabruno01/openfarm-catmeowsCatMeows: A Publicly-Available Dataset of Cat Vocalizations, Zenodo record 4008297, DOI 10.5281/zenodo.4008297, CC-BY-4.0
Laying hen stress vocalizationsoliveirabruno01/openfarm-laying-hen-stressVocalization Patterns in Laying Hens - An Analysis of Stress-Induced Audio Responses, Zenodo record 10433023, DOI 10.5281/zenodo.10433023, CC-BY-4.0
Ungulate vocalization valenceoliveirabruno01/openfarm-ungulate-valenceMachine Learning Algorithms Can Predict Emotional Valence Across Ungulate Vocalizations, Zenodo record 14636641, DOI 10.5281/zenodo.14636641, CC-BY-4.0

Quickstart

prime eval run openfarm-bioacustics \
  -a '{"task": "soundwel_valence", "modality": "audio", "max_examples_per_task": 20}'

Run several tasks as one EnvGroup:

prime eval run openfarm-bioacustics \
  -a '{"task": ["soundwel_valence", "catmeows_context"], "modality": "multimodal", "max_examples_per_task": 20}'

Use generated spectrograms for an audio-only dataset:

prime eval run openfarm-bioacustics \
  -a '{"task": "catmeows_context", "modality": "vision", "max_examples_per_task": 20}'

Run the ungulate valence benchmark on its balanced, non-pig animal-heldout test split:

prime eval run openfarm-bioacustics \
  -a '{"task": "ungulate_valence", "modality": "audio", "max_examples_per_task": 20}'

Environment Arguments

ArgTypeDefaultDescription
taskstr/list"soundwel_valence"Task name, list of task names, or "all".
modalitystr"audio"audio, vision, multimodal, or text.
include_tabular_databoolfalseAdds task-specific leakage-safe metadata. Forced on for text.
use_escape_hatchboolfalseAdds UNINTELLIGIBLE as an allowed answer.
max_examples_per_taskint-1Optional per-task subsampling budget.
balancing_strategystr"proportional"proportional or balanced when subsampling.
target_pad_secondsfloat3.592Center-pads short audio before encoding.
max_audio_secondsfloat/nullnullClips long audio before encoding or spectrogram generation.
audio_clip_strategystr"start"start, center, or even_windows for clipping long recordings.
require_explanationboolfalseRequires an <explanation> field before <answer>.
format_reward_weightfloat0.0Optional XML format reward weight.

Dataset Notes

Laying Hen Stress

  • Public audio artifacts are bounded 15-second mono 16 kHz excerpts.
  • Excerpts use even_windows over the longer Zenodo recordings because the source data does not provide stress-event timestamps.
  • The task still defaults to max_audio_seconds=15.0 and audio_clip_strategy="even_windows" as a safety net for older dataset revisions or local experiments pointed at full source recordings.

Ungulate Valence

  • Headline evals should use the cleaned, balanced train/test split pair.
  • train_raw and test_raw preserve the natural non-pig class distribution for explicit diagnostics.
  • pig_family_heldout isolates domestic pig and wild boar rows for optional transfer checks against SoundWel/PVWB-adjacent work.
  • Default tabular prompts use only acoustic features. Context, source reference, and animal ID can be predictive of the valence label.

Metrics

The live rubric is intentionally small:

MetricMeaning
accuracy_reward1.0 when the parsed answer matches the task label after normalization.
format_rewardOptional XML-format reward when format_reward_weight > 0.

Use post-eval reporting for macro accuracy, balanced accuracy, macro F1, and per-class recall.