GitHub - eyop23/go-rag

cla# Book Explorer RAG — Go Backend

A Retrieval-Augmented Generation (RAG) API built with Go, Pinecone, and Google Gemini for querying a dataset of 2,032 books from Goodreads.

Tech Stack

Go + Gin — HTTP server
Pinecone — Vector database (cosine similarity, 3072 dims)
Google Gemini — Embedding (gemini-embedding-001) + LLM (gemini-2.5-flash)

Project Structure

backend/
├── cmd/
│   ├── server/main.go      # API server entry point
│   └── seed/main.go         # Seed data into Pinecone (supports --resume)
├── config/config.go          # Environment config loader
├── models/types.go           # All data structs
├── services/
│   ├── embedding.go          # Gemini embedding generation
│   ├── pinecone.go           # Pinecone query, upsert & stats
│   ├── llm.go                # Gemini LLM answer generation
│   └── book.go               # Book record flattener
├── handlers/ask.go           # /ask endpoint handler
├── middleware/cors.go        # CORS middleware
├── data/
│   └── books.json            # 2,032 books with descriptions & genres
└── .env                      # API keys (not committed)

Setup

Create a .env file:

GOOGLE_API_KEY=your-google-api-key
GEMINI_API_URL=https://generativelanguage.googleapis.com/v1/models/gemini-2.5-flash:generateContent
PINECONE_API_KEY=your-pinecone-api-key
PINECONE_HOST=https://your-index.svc.pinecone.io
PORT=8090

Install dependencies:

go mod tidy

Seed Pinecone with book data:

go run ./cmd/seed

The free tier of Gemini allows ~1,000 embeddings per API key per day. To seed all 2,032 books:

# First run — embeds ~1,000 books, then hits rate limit
go run ./cmd/seed

# Swap API key in .env, then resume from where it stopped
go run ./cmd/seed --resume

Start the server:

go run ./cmd/server

API

POST /ask

{
  "query": "Recommend me a classic novel about justice"
}

Response:

{
  "answer": "To Kill a Mockingbird by Harper Lee is a classic novel about justice..."
}

How It Works

User query is converted to a 3072-dim vector via Gemini embedding
Pinecone finds the most similar book records via cosine similarity
Matched records are passed as context to Gemini LLM
LLM generates a grounded answer based only on the retrieved context

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
cmd		cmd
config		config
data		data
handlers		handlers
middleware		middleware
models		models
services		services
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tech Stack

Project Structure

Setup

API

POST /ask

How It Works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tech Stack

Project Structure

Setup

API

POST /ask

How It Works

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages