Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Accessibility API #49

Open
jlia0 opened this issue Sep 21, 2023 · 5 comments
Open

Accessibility API #49

jlia0 opened this issue Sep 21, 2023 · 5 comments

Comments

@jlia0
Copy link

jlia0 commented Sep 21, 2023

Is this connected to the accessibility API to retrieve context information (like url, app name, etc) yet?

@apirrone
Copy link
Owner

Hi, I tinkered a bit with the accessibility API to try to extract the text directly from the apps instead of using an OCR, but did not achieve much.
If you have any good reference material it would be great, or you can make a PR :)

@jlia0
Copy link
Author

jlia0 commented Sep 21, 2023

Here's one that cyte2 was referencing from: https://github.com/tmandry/AXSwift

However I am not sure if there is a Python API for it, do you mind sharing some of your tinkering code?

I believe we still need OCR for extracting the text, the accessibility api is for extracting "metadata" like url or window contexts.

@apirrone
Copy link
Owner

Unfortunately I don't seem to have kept my tinkering code :/ I tried to use ORCA screen reader (https://github.com/GNOME/orca) but it was not the right tool I think

@jlia0
Copy link
Author

jlia0 commented Oct 9, 2023

https://kevinchen.co/blog/rewind-ai-app-teardown/

^^^ I think this would probably help

@apirrone
Copy link
Owner

apirrone commented Oct 9, 2023

Yes this blog post was very helpful :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants