More

anupsurendran · on July 13, 2024

Some CTA stats included based on research with 12 SaaS companies

anupsurendran · on March 7, 2024

Does Pathway's ability to use multithreading improve the processing when you deal with multiple kafka topics?

anupsurendran · on Dec 16, 2023

I had the same question Jacknews, I couldnt find the person who is running this at vercel to ask this question. Maybe we should just tweet?

inrodos · on Dec 16, 2023

I don't think it is made by Vercel, they are hosting it on Vercel. There is a link to the developers Twitter and GitHub at the top of the page.

anupsurendran · on Dec 14, 2023

Historically yes - uppercases came first. Even when lower cases showed up, the combination key sequence to get the lowercases working was harder so that took more time to catchup

anupsurendran · on Dec 14, 2023

every side has a story to tell. a story without emotion is not worth telling.

Zetobal · on Dec 14, 2023

"story" it's not a fucking story that's the whole point of the discussion.

anupsurendran · on Dec 12, 2023

This is a Google Colab notebook (that works in Jupyter too). The notebook can connect to live data sources (e.g. Kafka) and you can do analysis. You could do a CSV replay if you have timestamped data entries

anupsurendran · on Dec 7, 2023

100% vthallam. The upside for OpenAI is much higher.

smileysteve · on Dec 7, 2023

.... Unless Sam Altman, major research leaders and 50%+ of the employees had transferred to Microsoft

anupsurendran · on Dec 5, 2023

Does this only work with Jupyter? Or does it also work with Google Colab?

anupsurendran · on Nov 17, 2023

Jan, can you explain briefly how the deduplicator checks if the new answer is significantly different? Is there code in the repository we can take a look at?

janchorowski · on Nov 17, 2023

Sure: when a new response is produced because some source documents have changed we ask an LLM to compare the responses and tell if they are significantly different. Even a simplistic prompt, like the one used in the example would do:

    Are the two following responses deviating?
    Answer with Yes or No.

    First response: "{old}"

    Second response: "{new}"

(used in https://github.com/pathwaycom/llm-app/blob/69709a2cf58cdf6ea...)

pstorm · on Nov 17, 2023

Couldn't you just compare the similarity of the embeddings? I imagine that would work in the vast majority of cases and save a lot of LLM calls.

janchorowski · on Nov 17, 2023

That's a good idea, the deduplication criterion is easy to change, using an llm is faster to get started, but after a while a corpus of decisions is created and can be used to either select another mechanism, or e.g. train one on top of bert embeddings.

anupsurendran · on Aug 9, 2023

I feel that there are too many moving pieces here especially for prototyping. There was a much more simpler app recently I took a look at on a recent hackernews post : https://news.ycombinator.com/item?id=36894142

They still have work to do with different connectors (e.g. PDF etc) but the realtime simple document pipeline is what helps a lot.