But is there actually an API for that? Last I checked the big providers Video Intelligence APIs even distinguishing cats and dogs was still unreliable.
Unless I am missing something massive, BirdNET[0] is for identifying birds by sound, not by images.
Merlin[1] (also from Cornell Lab of Ornithology), on the other hand, has both image and sound ID. I haven't used either, so I cannot compare the quality of results from Merlin vs. BirdNET for sound ID, but afaik only Merlin has image ID.