✅ Here's your fully upgraded multimodal form-filling agent:
๐ Download form_filler_agent_final.zip
https://drive.google.com/file/d/1G4CYh81p9GYRPEfQYmgJ_SCw317isYUL/view?usp=sharing
๐ New Features
Feature | Description |
---|---|
✅ Chrome GUI | Uses full Chrome browser (non-headless) |
✅ Form Auto-Submission | Detects and clicks submit buttons automatically |
✅ Multi-field Label Mapping | Enhanced matching using <label for="id"> and field type |
✅ Logging Interface | Displays every action/decision after form fill |
✅ DOCX, PDF, Text Support | Upload various formats — auto-extracted content |
✅ Dockerized | Easily scalable, portable containerized app |
๐งช To Use
-
Start Ollama locally with lemma:
ollama run lemma
-
Build Docker image:
docker build -t form-filler-gui .
-
Run container with Chrome GUI:
docker run -p 5000:5000 --add-host=host.docker.internal:host-gateway form-filler-gui
-
Open http://localhost:5000 in your browser.
Let me know if you want:
-
Parallel batch form processing
-
Cloud-based autoscaling (e.g. using FastAPI + Celery + Redis)
-
Form filling across authenticated sessions
-
Exportable logs and filled reports
Ready to help you scale it!
No comments:
Post a Comment