HACKER Q&A
📣 albert_e

How to build a private version of Code Interpreter


I have decent / beginner level understanding of ML and a bit of hands on experience using algorithms and training models; but have not jumped itno the LLM / GenAI whirlpool yet.

The latest "code intepreter" release from GenAI intrigues me.

The "Data Analyst" capabilities demoed look powerful and will definitely generate a lot of interest -- as it replaced the dependency, cost, and time of using analysts and interns for basic tasks. I am sure there are much more capabilities than basic analysis that will emerge in future building on top of this.

In current avatar it is obviously a no-no for enterprise customers and those with private/sensitive data to analyze.

Enterprises would obviously want a version of this model and functionality that is run on-prem -- or in their own private cloud or their own account in public cloud -- so that their confidential data never leaves their network or is shared with any third-party.

I want to use this idea as an opportunity to learn about LLMs and attempt to build such a tool -- in however small a scale / limited doamin -- as a proof of concept.

Appreciate any advice on where one should start to go about building such a tool and interface? Thanks!