arvind_k's comments

arvind_k · 2025-10-10T21:09:20 1760130560

At Zipphy, I worked on solving similar problems in on-prem environments — building an OCR + NLP + CV pipeline to generate spatial layouts and classify documents at scale.

One persistent challenge was generalizing across “wild” PDFs, especially multi-page tables.

Your mention of agentic OCR correction and semantic chunking really caught my attention. I’m curious — how did you architect those to stay consistent across diverse layouts without relying on massive rule sets?

arvind_k · 2025-07-15T16:32:16 1752597136

Thanks MIke will take a look at Lumen

arvind_k · on March 3, 2023

d1vyank ,

can we connect offline and discuss

thanks Arvinf 94230096(five)(four)

arvind_k · on Oct 28, 2020

yeah WIP

arvind_k · on Oct 28, 2020

thanks Ethan will try and embed your feedback

arvind_k · on Oct 19, 2020

Simple document modelling and classification platform.

arvind_k · on Aug 17, 2015

no ads in any form

mod50ack · on Aug 17, 2015

I agree, the browser shouldn't contain ads, but an adblocker is also a basically essential extension that many of us use.

Myself included.

It's basically impossible not to these days.

craigds · on Aug 17, 2015

Nowadays I refuse to fix anyone's computer without installing ad blockers in all installed browsers.

I'd consider it professionally irresonsible to let a non-technical user loose on the web without a good ad blocker. It's just too dangerous out there.

ciupicri · on Aug 17, 2015

And then you're left with advertorials which are harder to detect.

arvind_k · on Jan 9, 2013

Java script the good parts is worth a read