Skip to content

sadda.corpus

The corpus surface — projects, bundles, tiers, annotations. STABLE tier: won't break across minor versions.

Project loaders live at the top of the sadda package:

new_project builtin

new_project(path, name)

Creates a new sadda project at path (which must not already exist). Returns a sadda.Project handle ready for .add_speaker(...) / .add_session(...) / .add_bundle(...) calls.

open_project builtin

open_project(path)

Opens an existing sadda project at path. Applies any pending schema migrations first, writing a corpus.db.bak.<old_version> backup.

Project

Project

A sadda project: a directory holding audio, derived signals, attachments, and a SQLite-backed corpus database. Construct via sadda.new_project(...) or sadda.open_project(...).

root property

root

Project's filesystem root.

name property

name

Project's human-readable name (from the singleton project row).

audit_user property

audit_user

User string written into audit_log.user for mutations on this connection. Defaults to "local".

add_bundle method descriptor

add_bundle(name, source_audio_path, *, session_id=None, speaker_id=None, extra=None)

Registers a bundle by copying source_audio_path into the project's signals/original/ directory and recording its metadata in the corpus database. Returns the new bundle's id. Optional kwargs attach the bundle to a Session / Speaker and set a JSON extra payload.

add_bundle_split method descriptor

add_bundle_split(name_prefix, source_audio_path, chunk_seconds)

Splits a (typically very long) WAV into contiguous chunks of about chunk_seconds each, writing every chunk into the project as its own bundle named "<name_prefix>_NNN". The source is streamed, so memory stays flat regardless of length — this is how a file too large to load whole still gets in. Chunk audio preserves the source format; the final chunk holds the remainder. Returns the new bundle ids in order.

bundles method descriptor

bundles()

Lists all bundles in id order.

rename_bundle method descriptor

rename_bundle(bundle_id, new_name)

Renames a bundle's display name. The underlying WAV file is left untouched. Raises if bundle_id does not exist or the new name is empty / whitespace-only.

delete_bundle method descriptor

delete_bundle(bundle_id)

Permanently deletes a bundle and all its tiers, annotations, derived signals, and processing-run audit rows. Best-effort removes the underlying WAV from disk. No-op if bundle_id does not exist.

load_audio method descriptor

load_audio(bundle_id)

Loads the audio file for a bundle.

add_speaker method descriptor

add_speaker(name, *, sex=None, birth_year=None, notes=None, extra=None)

Inserts a Speaker row. Returns the new speaker's id.

speakers method descriptor

speakers()

Lists all speakers in id order.

get_speaker method descriptor

get_speaker(id)

Fetches a single speaker by id.

add_session method descriptor

add_session(name, *, started_at=None, ended_at=None, location=None, instrument_id=None, protocol_id=None, notes=None, extra=None)

Inserts a Session row. Returns the new session's id.

sessions method descriptor

sessions()

Lists all sessions in id order.

get_session method descriptor

get_session(id)

Fetches a single session by id.

add_instrument method descriptor

add_instrument(name, *, kind=None, serial=None, calibration=None, extra=None)

Inserts an instrument (microphone / interface), optionally with a Calibration. Returns the new instrument's id.

instruments method descriptor

instruments()

Lists all instruments in id order.

get_instrument method descriptor

get_instrument(instrument_id)

Fetches a single instrument by id.

bundle_calibration method descriptor

bundle_calibration(bundle_id)

Resolves a bundle's calibration via bundle → session → instrument. None means levels for that bundle are dB-FS only.

add_tier method descriptor

add_tier(bundle_id, name, type, *, parent_id=None, cardinality=None, schema=None, extra=None)

Inserts a Tier row. type is one of interval, point, reference, continuous_numeric, continuous_vector, categorical_sampled. Returns the new tier's id.

tiers method descriptor

tiers(bundle_id=None)

Lists tiers, optionally restricted to a single bundle.

get_tier method descriptor

get_tier(id)

Fetches a single tier by id.

rename_tier method descriptor

rename_tier(tier_id, new_name)

Renames a tier's display name. Raises if tier_id does not exist or the new name is empty / whitespace-only.

delete_tier method descriptor

delete_tier(tier_id)

Deletes a tier and all of its annotations (intervals / points / references) and any dense derived-signal sidecar. Raises if the tier has child tiers (delete those first) or if tier_id does not exist.

add_interval method descriptor

add_interval(tier_id, start_seconds, end_seconds, *, label=None, parent_annotation_id=None, status=None, note=None, extra=None)

Inserts an interval annotation. Enforces parent-child cardinality at insert time; raises ValueError on cardinality violation.

intervals method descriptor

intervals(tier_id)

Lists intervals for a tier in (start_seconds, id) order.

add_point method descriptor

add_point(tier_id, time_seconds, *, label=None, parent_annotation_id=None, status=None, note=None, extra=None)

Inserts a point annotation. Enforces parent-child cardinality.

points method descriptor

points(tier_id)

Lists points for a tier in (time_seconds, id) order.

add_reference method descriptor

add_reference(tier_id, target_kind, target_id, *, label=None, parent_annotation_id=None, extra=None)

Inserts a reference annotation pointing at another row via (target_kind, target_id).

references_for method descriptor

references_for(tier_id)

Lists references for a tier in id order. Named references_for rather than references (which collides with Rust's ref family of grep targets).

query

query(tier_id)

Returns the rows of a sparse tier as a polars.DataFrame. Columns depend on the tier's type:

  • interval: id, tier_id, start_seconds, end_seconds, duration_seconds, label, parent_annotation_id, extra
  • point: id, tier_id, time_seconds, label, parent_annotation_id, extra
  • reference: id, tier_id, target_kind, target_id, label, parent_annotation_id, extra

Dense tiers (continuous_numeric / continuous_vector / categorical_sampled) live in Parquet sidecars and arrive in B3.

write_continuous_numeric method descriptor

write_continuous_numeric(tier_id, samples, sample_rate_hz)

Writes a continuous_numeric Parquet sidecar from a 1-D float64 NumPy array and inserts the matching DerivedSignal row. Returns the new DerivedSignal id. Errors with TypeError-style messages if the tier isn't continuous_numeric or already has a sidecar.

write_continuous_vector method descriptor

write_continuous_vector(tier_id, frames, sample_rate_hz)

Writes a continuous_vector Parquet sidecar from a 2-D float64 NumPy array of shape [n_frames, n_dims].

write_categorical_sampled method descriptor

write_categorical_sampled(tier_id, labels, sample_rate_hz)

Writes a categorical_sampled Parquet sidecar from a list of strings.

read_continuous_numeric method descriptor

read_continuous_numeric(tier_id)

Reads a continuous_numeric sidecar back into a 1-D float64 NumPy array.

read_continuous_vector method descriptor

read_continuous_vector(tier_id)

Reads a continuous_vector sidecar back into a 2-D float64 NumPy array.

read_categorical_sampled method descriptor

read_categorical_sampled(tier_id)

Reads a categorical_sampled sidecar back into a list of strings.

dense_path method descriptor

dense_path(tier_id)

Returns the absolute filesystem path of a dense tier's Parquet sidecar (as a string), or None if no sidecar has been written yet. Use with polars.scan_parquet(path) for zero-engine-API reads.

derived_signal method descriptor

derived_signal(tier_id)

Returns the DerivedSignal row for a tier, or None if no sidecar has been written yet.

import_textgrid method descriptor

import_textgrid(path, bundle_id)

Imports a Praat TextGrid into bundle_id. Each Praat tier becomes a new Tier row (interval or point); each annotation becomes an annotation_interval / annotation_point row. JSON sentinels in labels are decoded back into the extra field. Returns the list of new tier IDs in import order. Records a processing_run row for audit provenance.

export_textgrid method descriptor

export_textgrid(bundle_id, path, *, tier_ids=None)

Writes a Praat TextGrid for bundle_id's sparse tiers to path. If tier_ids is given, only those tiers are exported. Dense tiers (continuous_numeric / vector / categorical_sampled) are skipped. Reference tiers are exported as IntervalTiers with a degenerate [0.0, 0.001] time span plus a JSON sentinel carrying their (target_kind, target_id).

import_eaf method descriptor

import_eaf(path, bundle_id)

Imports an ELAN .eaf into bundle_id. Tier hierarchy is preserved via EAF's PARENT_REFtier.parent_id. Point tiers are recovered from degenerate [t, t+1ms] alignable annotations via a ≤2ms heuristic. Reference tiers (Symbolic_Association linguistic type) come back as reference tiers. Returns the new tier IDs in import order (parents first per topological sort). Records a processing_run row for audit provenance.

export_eaf method descriptor

export_eaf(bundle_id, path, *, tier_ids=None)

Writes an ELAN .eaf (EAF 2.8) for bundle_id's sparse tiers to path. If tier_ids is given, only those tiers are exported. Dense tiers (continuous_numeric / vector / categorical_sampled) are skipped. Interval tiers with parents use the Included_In stereotype; reference tiers become REF_ANNOTATION tiers with the Symbolic_Association stereotype + a JSON sentinel encoding (target_kind, target_id).

record_processing_run method descriptor

record_processing_run(bundle_id, kind, processor_id, *, parameters=None, input_tier_ids=None, output_tier_ids=None, output_signal_ids=None, weights_checksum=None)

Records a completed processing run for audit provenance and returns its id. The engine fills in the sadda version, timestamps, and active recipe id. kind is one of dsp_algorithm, ml_model, clinical_measure, plugin, live_recording.

processing_runs method descriptor

processing_runs(bundle_id)

Returns a bundle's processing-run timeline (provenance), oldest first.

citations method descriptor

citations(bundle_id)

Returns the literature citations for the analyses a bundle used, deduplicated by processor and ordered by first use. Uncited processors (imports, recording) are omitted.

extract_embeddings method descriptor

extract_embeddings(bundle_id, model_id, tier_name)

E12: resolve model_id (sadda/… / local://… / hf://…), run it as an embedding extractor over bundle_id's audio, and store the result as a new continuous_vector tier tier_name, recording an ml_model processing run. Returns the new tier id. (Provisional.)

pin_refdist method descriptor

pin_refdist(id, version)

Pins a reference distribution id to a specific version in project.toml, so the project reopens against the same data for reproducibility (C7). Overwrites any existing pin for that id.

refdist_pins method descriptor

refdist_pins()

The reference distributions this project has pinned, as a {id: version} dict.

remove_refdist_pin method descriptor

remove_refdist_pin(id)

Removes a reference-distribution pin; returns whether one existed.

set_audit_user method descriptor

set_audit_user(user)

Sets the user string written into audit_log.user for subsequent mutations on this connection.

Data types

Audio

Audio data loaded from disk. Samples are interleaved float32 in [-1.0, 1.0]; for stereo the layout is [L0, R0, L1, R1, ...]. Construct via sadda.load_wav(path).

__doc__ class-attribute

__doc__ = 'Audio data loaded from disk. Samples are interleaved float32 in `[-1.0, 1.0]`;\nfor stereo the layout is `[L0, R0, L1, R1, ...]`. Construct via\n`sadda.load_wav(path)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

channels property

channels

Number of audio channels (1 = mono, 2 = stereo, …).

duration_seconds property

duration_seconds

Audio duration in seconds.

n_frames property

n_frames

Number of audio frames (samples per channel).

sample_rate property

sample_rate

Sample rate in Hz.

samples property

samples

Interleaved samples as a 1-D float32 NumPy array. For stereo the layout is [L0, R0, L1, R1, ...].

__repr__ method descriptor

__repr__()

Return repr(self).

mono method descriptor

mono()

Mono mixdown of the audio as a 1-D float32 NumPy array.

AudioProbe

AudioProbe(*args, **kwargs)

Header-only summary of a WAV file (see sadda.probe_wav): its size without the cost of decoding. Lets a caller gauge a file's in-memory footprint before loading it.

Initialize self. See help(type(self)) for accurate signature.

__doc__ class-attribute

__doc__ = "Header-only summary of a WAV file (see `sadda.probe_wav`): its size without\nthe cost of decoding. Lets a caller gauge a file's in-memory footprint\nbefore loading it."

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'provisional'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

channels property

channels

Number of channels.

decoded_bytes property

decoded_bytes

Bytes a full decode would occupy (interleaved f32): the RAM cost of loading this file whole.

duration_seconds property

duration_seconds

Duration in seconds.

n_frames property

n_frames

Number of frames (samples per channel).

sample_rate property

sample_rate

Sample rate in Hz.

__repr__ method descriptor

__repr__()

Return repr(self).

Bundle

One recording inside a [Project]: audio header plus optional Session + Speaker FKs and a freeform JSON extra payload. Read-only view; mutate via Project.add_bundle(...).

__doc__ class-attribute

__doc__ = 'One recording inside a [`Project`]: audio header plus optional Session +\nSpeaker FKs and a freeform JSON `extra` payload. Read-only view; mutate\nvia `Project.add_bundle(...)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

audio_relative_path property

audio_relative_path

Audio file path relative to the project root.

channels property

channels

Number of audio channels.

extra property

extra

Freeform JSON payload (stored as text).

id property

id

Bundle id (primary key).

n_frames property

n_frames

Number of audio frames (samples per channel).

name property

name

Human-readable bundle name.

sample_rate property

sample_rate

Audio sample rate in Hz.

session_id property

session_id

Optional Session id this bundle belongs to.

speaker_id property

speaker_id

Optional Speaker id this bundle recorded.

__repr__ method descriptor

__repr__()

Return repr(self).

Tier

One annotation tier: the header row in tier. Annotation rows belonging to it live in annotation_interval / annotation_point / annotation_reference (for the three sparse types) or a Parquet sidecar (the three dense types, landing in B3). Read-only view; create via Project.add_tier(...).

__doc__ class-attribute

__doc__ = 'One annotation tier: the header row in `tier`. Annotation rows\nbelonging to it live in `annotation_interval` / `annotation_point` /\n`annotation_reference` (for the three sparse types) or a Parquet sidecar\n(the three dense types, landing in B3). Read-only view; create via\n`Project.add_tier(...)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

bundle_id property

bundle_id

Bundle this tier belongs to.

cardinality property

cardinality

Parent-child cardinality (one_to_one | one_to_many | many_to_one | none).

created_at property

created_at

ISO 8601 UTC creation timestamp.

extra property

extra

JSON extra payload.

id property

id

Tier id (primary key).

name property

name

Human-readable tier name (unique within a bundle).

parent_id property

parent_id

Optional parent tier id.

schema property

schema

JSON schema payload.

type property

type

Tier type: one of interval, point, reference, continuous_numeric, continuous_vector, categorical_sampled.

__repr__ method descriptor

__repr__()

Return repr(self).

Interval

One interval annotation. Read-only view; create via Project.add_interval(...).

__doc__ class-attribute

__doc__ = 'One interval annotation. Read-only view; create via\n`Project.add_interval(...)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

duration_seconds property

duration_seconds

Duration in seconds (end_seconds - start_seconds).

end_seconds property

end_seconds

End time in seconds.

extra property

extra

JSON extra payload.

id property

id

Annotation id.

label property

label

Label string.

note property

note

Free-text note, or None.

parent_annotation_id property

parent_annotation_id

Parent annotation id in the parent tier.

processing_run_id property

processing_run_id

Provenance link to the producing ProcessingRun (e.g. a criterion run), or None for a hand-made annotation.

start_seconds property

start_seconds

Start time in seconds.

status property

status

Annotation status (a rubric-defined status string), or None.

tier_id property

tier_id

Tier this interval belongs to.

__repr__ method descriptor

__repr__()

Return repr(self).

Point

One point annotation. Read-only view; create via Project.add_point(...).

__doc__ class-attribute

__doc__ = 'One point annotation. Read-only view; create via\n`Project.add_point(...)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

extra property

extra

JSON extra payload.

id property

id

Annotation id.

label property

label

Label string.

note property

note

Free-text note, or None.

parent_annotation_id property

parent_annotation_id

Parent annotation id.

processing_run_id property

processing_run_id

Provenance link to the producing ProcessingRun (e.g. a criterion run), or None for a hand-made annotation.

status property

status

Annotation status (a rubric-defined status string), or None.

tier_id property

tier_id

Tier this point belongs to.

time_seconds property

time_seconds

Time in seconds.

__repr__ method descriptor

__repr__()

Return repr(self).

Reference

One reference annotation. Read-only view; create via Project.add_reference(...).

__doc__ class-attribute

__doc__ = 'One reference annotation. Read-only view; create via\n`Project.add_reference(...)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

extra property

extra

JSON extra payload.

id property

id

Annotation id.

label property

label

Label string.

parent_annotation_id property

parent_annotation_id

Parent annotation id.

target_id property

target_id

Target row id.

target_kind property

target_kind

Target kind: bundle | session | speaker | tier | annotation.

tier_id property

tier_id

Tier this reference belongs to.

__repr__ method descriptor

__repr__()

Return repr(self).

DerivedSignal

Registration row for a Parquet sidecar holding a dense tier's data. Created automatically by the Project.write_continuous_numeric / write_continuous_vector / write_categorical_sampled methods.

__doc__ class-attribute

__doc__ = "Registration row for a Parquet sidecar holding a dense tier's data.\nCreated automatically by the `Project.write_continuous_numeric` /\n`write_continuous_vector` / `write_categorical_sampled` methods."

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

created_at property

created_at

ISO 8601 UTC creation timestamp.

dtype property

dtype

Dtype label: f64, f32, utf8.

extra property

extra

Freeform JSON payload.

id property

id

DerivedSignal id (primary key).

n_dims property

n_dims

Number of dimensions per frame.

n_frames property

n_frames

Number of frames in the sidecar.

relative_path property

relative_path

Path to the Parquet sidecar, relative to the project root.

sample_rate_hz property

sample_rate_hz

Sample rate in Hz; None for non-sampled / variable-rate signals.

tier_id property

tier_id

Tier this sidecar belongs to.

__repr__ method descriptor

__repr__()

Return repr(self).

Speaker

A person who produced speech in the project (participant, patient, case subject, …). Read-only view; create via Project.add_speaker(...).

__doc__ class-attribute

__doc__ = 'A person who produced speech in the project (participant, patient, case\nsubject, …). Read-only view; create via `Project.add_speaker(...)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

birth_year property

birth_year

Birth year (full integer year).

created_at property

created_at

ISO 8601 UTC creation timestamp.

extra property

extra

Freeform JSON payload.

id property

id

Speaker id (primary key).

name property

name

Human-readable name or pseudonymous identifier.

notes property

notes

Freeform notes.

sex property

sex

Sex / gender label (free text).

__repr__ method descriptor

__repr__()

Return repr(self).

Session

A recording session — a time-bounded block during which one or more bundles were captured. Read-only view; create via Project.add_session(...).

__doc__ class-attribute

__doc__ = 'A recording session — a time-bounded block during which one or more\nbundles were captured. Read-only view; create via `Project.add_session(...)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

created_at property

created_at

ISO 8601 UTC creation timestamp.

ended_at property

ended_at

ISO 8601 UTC end timestamp.

extra property

extra

Freeform JSON payload.

id property

id

Session id (primary key).

instrument_id property

instrument_id

FK into the instrument table.

location property

location

Free-form location label.

name property

name

Human-readable session name.

notes property

notes

Freeform notes.

protocol_id property

protocol_id

FK into the protocol table.

started_at property

started_at

ISO 8601 UTC start timestamp.

__repr__ method descriptor

__repr__()

Return repr(self).

Instrument

A capture instrument (microphone, interface) and its optional calibration. Returned by Project.instruments() / get_instrument().

__doc__ class-attribute

__doc__ = 'A capture instrument (microphone, interface) and its optional\ncalibration. Returned by `Project.instruments()` / `get_instrument()`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

calibration property

calibration

Calibration, if the instrument has been calibrated.

created_at property

created_at

ISO 8601 UTC creation timestamp.

extra property

extra

Freeform JSON payload.

id property

id

Instrument id (primary key).

kind property

kind

Kind label (e.g. "microphone").

name property

name

Human-readable name.

serial property

serial

Serial number.

__repr__ method descriptor

__repr__()

Return repr(self).

Calibration

Microphone / signal-chain calibration mapping dB-FS to dB-SPL. Construct with Calibration(reference_spl_db=…, reference_db_fs=…).

__doc__ class-attribute

__doc__ = 'Microphone / signal-chain calibration mapping dB-FS to dB-SPL.\nConstruct with `Calibration(reference_spl_db=…, reference_db_fs=…)`.'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

reference_db_fs property

reference_db_fs

dB-FS measured for that tone.

reference_spl_db property

reference_spl_db

SPL of the calibration tone (dB-SPL).

__new__ builtin

__new__(*args, **kwargs)

Create and return a new object. See help(type) for accurate signature.

__repr__ method descriptor

__repr__()

Return repr(self).

spl_offset_db method descriptor

spl_offset_db()

dB added to a dB-FS reading to obtain dB-SPL.

to_spl method descriptor

to_spl(db_fs)

Converts a relative dB-FS value to calibrated dB-SPL.

ProcessingRun

One row of a bundle's provenance timeline — an analysis that ran on the bundle. Returned by Project.processing_runs(bundle_id).

__doc__ class-attribute

__doc__ = "One row of a bundle's provenance timeline — an analysis that ran on\nthe bundle. Returned by `Project.processing_runs(bundle_id)`."

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

bundle_id property

bundle_id

Bundle the run targeted.

finished_at property

finished_at

ISO 8601 UTC finish timestamp, if recorded.

id property

id

Processing-run id (primary key).

kind property

kind

dsp_algorithm | ml_model | clinical_measure | plugin | live_recording.

output_tier_ids property

output_tier_ids

JSON array of produced tier ids, if any.

parameters property

parameters

JSON parameters (processor-specific shape), if any.

processor_id property

processor_id

Reverse-DNS processor id, e.g. sadda.dsp.pitch.autocorrelation.

processor_version property

processor_version

Sadda version at run time.

started_at property

started_at

ISO 8601 UTC start timestamp.

status property

status

ok | error | partial.

__repr__ method descriptor

__repr__()

Return repr(self).

Citation

A literature reference for an analysis a bundle used. Returned by Project.citations(bundle_id), suitable for a paper's reference list.

__doc__ class-attribute

__doc__ = "A literature reference for an analysis a bundle used. Returned by\n`Project.citations(bundle_id)`, suitable for a paper's reference list."

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__module__ class-attribute

__module__ = 'sadda._native'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

__sadda_stability__ class-attribute

__sadda_stability__ = 'stable'

str(object='') -> str str(bytes_or_buffer[, encoding[, errors]]) -> str

Create a new string object from the given object. If encoding or errors is specified, then the object must expose a data buffer that will be decoded using the given encoding and error handler. Otherwise, returns the result of object.str() (if defined) or repr(object). encoding defaults to sys.getdefaultencoding(). errors defaults to 'strict'.

doi property

doi

Bare DOI, if one exists.

processor_id property

processor_id

The processor this cites (matches ProcessingRun.processor_id).

reference property

reference

Human-readable reference string.

__repr__ method descriptor

__repr__()

Return repr(self).