Arjun Kirti Patel

Developer AdvocateChicago, Illinois

Companies:(Select multiple)
all
Pinecone
Personal
Appen
Content Types:(Select multiple)
all
Webinar
Youtube
Github
Demo
Article
Live Presentation
Pinecone
Webinar
Youtube
Github
Demo

Contextual Retrieval with Pinecone and Anthropic

Authored "Contextual Retrieval with Pinecone and Anthropic" - technical breakdown of Applied the technique to a real-world, novel problem: asking visual/audio/image questions of webinar slideshows Recorded live webinar with over 900 registrants and 300+ attendees, over 1k views on YouTube

Contextual RAG
Multimodality
Pinecone
Anthropic
AWS Bedrock
Streamlit
Pinecone
Article
Github
Demo

The Practitioner's Guide to E5

Authored "The Practitioner's Guide to E5" - walkthrough of Pinecone's landmark Inference endpoint complete walkthrough of multilingual embedding models and using Pinecone created novel example demonstrating monolingual, crosslingual, and multilingual search

Multilingual Semantic Search
Embedding Models
Pinecone
Jupyter Notebook
Pinecone
Article

Refine Retrieval Quality with Pinecone Rerank

Created "Refine Retrieval Quality with Pinecone Rerank" - technical breakdown of how top result on Google for "refine with rerank", along with AI Overview from Google

RAG
Reranking
Pinecone
Jupyter Notebook
Hugging Face
Pinecone
Webinar
Youtube

The Magic of Multilingual Search

Hosted "The Magic of Multilingual Search" webinar Demonstrated how Pinecone's multilingual search can be used to search across multiple languages first principles approach to multilingual embedding models and search

Multilingual Semantic Search
Embedding Models
Pinecone
Jupyter Notebook
Hugging Face
Pinecone
Youtube
Webinar

RAGBrag interview series with CEOs using Pinecone

Produced RAGBrag interview series with CEOs using Pinecone Interviewed CEOs from companies like Pinecone, Anthropic, and Cohere Featured on Pinecone's blog

RAG
Pinecone
Public Speaking
Personal
Live Presentation
Github

Hypothesis Sage

CLI application that conducts agentic RAG over a statistical knowledge database, to identify statistical tests and methods for a given question, create examples of test application, and explain statistical concepts Uses Llamaindex Workflows, Cohere (tool use), Pinecone, Typer(cli), and structured generation using Llama 2 deployed on Baseten to conduct specific actions given user queries Presented at AI Tinkerers Chicago November 2024 Talk

RAG
Agentic RAG
Pinecone
Typer
Llama 2
Baseten
LlamaIndex Workflows
Personal
Github
Demo

Alakazam-VGC

Web application that assists competitive Pokémon VGC players in querying Pokémon data and performing speed checks Uses LLMs to parse natural language queries into JSON dicts, then passes to functions for calculation Built using Hugging Face, Langchain, Supabase, Modal, and Streamlit

LLM Application Development
Public Speaking
Finetuning LLMs
LLM Evaluation Streamlit
Modal
Hugging Face
Model Deployment
Structured Generation
Personal
Github
Demo

Perfect Prompt

First place winner in Cohere Hackathon #3 2022 Takes user prompts in Streamlit interface, identifies artistic style using Cohere classification Returns optimized prompts for iterative development from finetuned Cohere generation endpoints

LLM Application Development
Public Speaking
Finetuning LLMs
Prompt Engineering
Streamlit
Cohere
Hugging Face
Appen
Article

Deciphering AI from Human Generated Text: The Behavioral Approach

coauthored blog post on overview of research published at HCOMP 2023

AI Generated Content Detection
NLP
RAG
Experimental Design
Data Collection
Appen
Article

Navigating the AI Detection Landscape

coauthored blog post on experimental design and landscape of third party AI content detection tooling overviews results of custom internal benchmark on detecting AI generated content

AI Generated Content Detection
NLP
Experimental Design
Benchmarking
Appen
Article

Boosting Data Quality with Appen's Human-centric AI Detector Model

created results and coauthored blog post on results from applying internal AI detection models to real-world curated dataset from aforementioned experiment followup from HCOMP study on large scale dataset, demonstrating several-fold improvement in detection accuracy using behavioral techniques

AI Generated Content Detection
NLP
Experimental Design
Benchmarking
Machine Learning