Skip to main content

Frequently Asked Questions

Quick answers about pricing, recording equipment, and our service

Pricing

We use simple pay-per-use pricing. Our ML processes at faster than realtime speed, so you pay a fraction of traditional analysis costs. Our costs are so low we have to offer it to you in hours to make sense.

See current rates and examples on our Pricing page — prices shown in your local currency.

Your audio processes faster than realtime speed through an our ML. We seriously know Secure Enterprise Cloud at scale. And, we charge you by the minute only for the actual compute used for your audio.

Yes! After uploading, you'll see the exact cost before checkout. We use Stripe for secure payment processing. No surprises, no hidden fees.

We accept all major credit cards via Stripe: Visa, Mastercard, American Express, and more. Prices are shown in AUD and converted to your local currency at checkout.

Yes! We are very confident in our algorithm and service, so we give you 10 free ML mins of free ML time for new users. It's easy to check out — try Record Now or just upload a short recording. Honestly, we found some funny noises that can be detected but we won't spoil your fun...

Recording

You can use devices you already own:

  • Smartphone - Any old phone works great as a dedicated recorder
  • Digital voice recorder - Sony/Olympus from $30-80 work perfectly
  • Security camera - Ring, Nest, Arlo, Wyze all record audio
  • PC/Laptop - Record with Audacity (free) or any audio software

Yes! Logged-in users can use our Record Now subcomponent to capture audio directly from their device's microphone with GPS location tracking. Great for on-the-go noise documentation.

We accept MP3, WAV, M4A, FLAC, OGG, WebM, and most common audio/video formats. Our system extracts audio from video files automatically.

Most security cameras let you download clips from their app. Export as MP4 when possible:

  • Ring - Select event → Share → Save Video (MP4)
  • Nest/Google - Google Home app → View event → Download (MP4)
  • Arlo - Select clip → Share → Save to Device (subscription required)
  • Wyze - View event → Download (MP4)
  • Eufy - Select clip → Download (app only, no web)
  • Blink - Select clip → Download, or backup to USB via Sync Module
  • SimpliSafe - Select clip → Download (one at a time)
  • Reolink - Download via app or Reolink Client software (MP4)
  • TP-Link Tapo - View clip → Download (local WiFi only)
  • Swann - Export via SwannView software (MP4 or AVI)

Pro/NVR systems (Hikvision, Dahua, UniFi, Lorex): Export to MP4 format using the manufacturer's software. Proprietary formats (DAV, UBV) are not supported.

Upload the exported video directly - we extract audio automatically from all standard formats.

Integrations

Yes! Our REST API supports automated uploads from any system that can make HTTP requests:

  • Home Assistant - Use rest_command to trigger analysis on noise detection
  • Raspberry Pi - Script continuous monitoring with auto-upload
  • NAS/NVR - Batch upload recordings with our API
  • Frigate - Webhook integration for event-based analysis

See our Integrations page for setup guides.

You can build a continuous monitoring station for under $100 using:

  • Raspberry Pi + USB mic - Record in 30-min chunks, auto-upload to API
  • ESP32 + I2S mic - Ultra-low-power, battery/solar viable
  • Old smartphone - Free if you have one, use Tasker for automation

Our Edge Devices guide has full setup instructions.

Yes! Add our REST command to your Home Assistant config and trigger analysis automatically:

  • Camera noise detection triggers recording
  • Recording uploads to H‑ear API
  • Optional webhook callback when analysis completes
  • Results available in dashboard or via webhook

See the Home Assistant tab on our Integrations page.

When submitting audio via API, you can include a callbackUrl. We'll POST results to your server when analysis completes, including HMAC-SHA256 signature verification. Great for automation workflows where you need to act on results immediately.

Yes! Our batch API endpoint accepts up to 50 files in a single request. Perfect for:

  • Processing SD card dumps from security cameras
  • Bulk import from NAS or cloud storage
  • Nightly batch processing of accumulated recordings

See Code Samples for batch examples.

General

Audioset based models (YAMNet, PANNS) can classify approximately 521 different sound types including:

  • Dog barking, cat meowing, animal sounds
  • Traffic, horns, vehicle engines
  • Aircraft, helicopters, drones
  • Construction equipment, power tools
  • Music, voices, TV/radio
  • Environmental sounds (wind, rain, birds)

We use our H-ear algorithm to mathematically analyze the ML in a fully reproducible manner. H-ear algorithmically considers temporal probability of connectivity between raw ML sound events. H-ear enables richer context of the world around us via sound annotation, e.g. that was a dog barking, a vehicle drove by, a noise was on but is now off, etc.

Analysis runs at faster than realtime. You'll receive a notification via email when complete.

Yes. Audio files are encrypted in transit and at rest. Files are processed in isolated containers. We never share your data with third parties. Your data and privacy are respected and secured with the highest technical and privacy consideration.

Please check in with the Help & Status page. Please do contact our support team with any questions or concerns with our service. If you need help in any way or you have any feedback whatsoever, please do not hesitate to get in touch with the humans in the backend at support@h-ear.world.

H-ear your Environment

Play the audio. Interact with the annotation timeline. Download 100% real output and compare H-ear noiseEvents versus ML rawPredictions (we give you both).

0 / 1m 1s
21/26
Standard
Detail
Wild animaSnoringFrog
002:31:49 AM8.8s02:31:58 AM17.6s02:32:07 AM26.3s02:32:15 AM35.1s02:32:24 AM43.9s02:32:33 AM52.7s02:32:42 AM1m 1s02:32:51 AM
Analysis
Marker
Leaflet © OpenStreetMap contributors
Job ID: demo-job
26
Total Events
26
Total Events
40.3
Avg dB
40.3
Avg dB
62.0
Max dB
62.0
Max dB
70%
Avg Confidence
70%
Avg Confidence
YAMNet
Model
YAMNet
Model
Detected Sounds
Animal: 7
Human sounds: 5
Source-ambiguous sounds: 4
Sounds of things: 6
Music: 4
Top Noise Sources

1. Animal > Livestock, farm animals, working animals > Fowl

2 events · 5.8s · 100% conf
Fowl_15

2. Human sounds > Respiratory sounds > Breathing

1 events · 3.8s · 100% conf
Breathing_1

3. Animal > Wild animals > Frog

1 events · 2.9s · 100% conf
Frog_0
Snippet Details
Snippet ID
demo-snippet
Original Filename
demo-60s-fixture-1.mp3
Duration

62.277s (1m 2s)

File Size

973.9 KB

Source Type

Upload

GPS Location
Latitude

-35.250830

Longitude

149.049271

Accuracy

212m

GPS Timestamp

7 Apr 2:31 am

GPS Source

browser

Timezone

Australia/Sydney

Timestamps
Recording Started

7 Apr 2:31 am

Recording Ended

7 Apr 2:32 am

Created At

10 Apr 7:11 pm

Updated At

10 Apr 7:11 pm

Ready to get started?