LookAside¶
如果说LLM带来的机会是如同互联网或者移动手机的级别的话,那LLM扮演的将会是一种全新处理信息的方式,而不是一种自动化手段。 因为传统的系统需要explicitly对事物逻辑进行建模,i.e. relational schema,而LLM则implicitly对事物逻辑进行建模,i.e. model weight。
如果说过去20年的搜广推解决了信息触达的问题,那么为什么人们还会有信息茧房呢? 因为瓶颈是在信息的处理和整理上, which is why we build lookaside。
具体而言,LookAside解决几个核心问题:
- Capture
- Organize
- Output
Why, Who and How?¶
User persona: knowledge workers with doing research as the center of the worklow.
Current Offerings¶
- Cubox
- Protolyst
- Elicit: Find research paper to cite and define reserach directions.
- Infranodus
Alternative solutions:
- Note-taking Apps
- Bookmark Apps
- Read-it-later / Reading Apps
How to establish a scaling advantage?¶
How to make LookAside 5% better every week?¶
How to solve the cold-start problem?¶
Short Term Planning¶
- Do a comprehensive research on existing researching tools
- what do they offer and what's their understanding of the problem?
- Derisk the intelligence features, i.e. prompt engineering
- Evaluate relevant offerings
- Implement LlamaIndex with Vercel + Supabase + Modal
- Derisk the objective decompose capability
- Study Relevant Projects' Codebase
Relevant Researching Tools:
- Guides & Notes by Genei
- Cubox
- UpWord on Knowledge Works powered by AI
- Elink on bookmarking for content creation
- Elicit on AI assisted research
Relevant Projects:
Relevant Github Projects:
- LlamaIndex -- How LlamaIndex chunk the docs?
- AutoGPT -- How LLM decomposes an objective?
- gpt-engineer -- How users interact with the system
- embedchain -- How it load youtube videos?
- aider -- UX logic with LLMs
- Danswer -- QA query on private data
- EasyEdit -- Modify LLMs to align with intent
- AutoDoc -- Auto generate Docs based on codebase
- khoj -- NBX for Emacs & Obsidian
- llm_agents -- Study how LLM agent works
- Customer Service GPT -- How to use process to drive GPT models
- ArxivDigest -- How to integrate the pipeline from Arxiv
- gpt-researcher -- How to orchestrate differnt agents
Intelligence Features¶
- QA bot based on the content of a set of records (LlamaIndex)
- Decompose an objective into smaller more actionable ones (AutoGPT)
- Consolidate two objectives into one objective
- Reason the object's viability based on the collected information
- Reason the gap between the available information and the objective and ask for more information (gpt-engineer)
- Topic modeling based on the content of the saved records
Process to learn a codebase¶
- Identify a typical user workflow (E2E tests)
- Identify the key subsystems and how they are composed together
- Identify the key APIs & data structures of the system
Questions to answer¶
- Objective和Task的关系和区别是什么?
- Annotation/Capture, whether having it really matters or not?
- Record的多种形态:Webpage,Notes,Files
Vision¶
LookAside is an attempt to rethink how knowledge workers work in the LLM area.
- Human should focus on collecting the right information, LLM would do the work automatically.
- Evidence: Elink create content automatically
- User owns the objective, it cannot be delegated to the system
Power of Random Thoughts¶
Putting the right information in the right place is the key to remedy information anxiety.
How to find seed users?¶
- VCs are idea customers but probably not the best for seed users, as they don't keep the secret.
- Ideal seed users would be the people that won't raise legal issues with me, i.e. copyright
- Friends, but they care about their data and privacy
- Users who are interested in similar projects
How to formulate the feedback loop of objective modeling?¶
Say in feteching relevant objectives, what if the user knows a better candidate than the system, how do we capture such feedback as input?
Similar Projects Analysis¶
Protolyst¶
The key value provided by Protolyst is making the evolution of each idea or claim back-trackable.
Go To Market¶
Youtube Influencers