Skip to content

feat(lapis): add llms.txt#1551

Closed
fhennig wants to merge 4 commits intomainfrom
llms-txt
Closed

feat(lapis): add llms.txt#1551
fhennig wants to merge 4 commits intomainfrom
llms-txt

Conversation

@fhennig
Copy link
Contributor

@fhennig fhennig commented Feb 20, 2026

resolves #1544

I've added a llms.txt that lists the metadata fields and genes that are used, gives some basic API info and also links to the docs, OpenAPI spec and Swagger.

I've decided to template this with Thymeleaf, and also cleaned up the landing page HTML to use templating like that as well. It's a new dependency, but part of the Spring Stack already, so I believe it's not a big deal, and it makes the code much nicer.

I ran this locally, and it worked!

Links in the md file are obviously not clickable, but I used root paths so hopefully an LLM would understand how the links are supposed to work.

PR Checklist

  • All necessary documentation has been adapted.
  • The implemented feature is covered by an appropriate test.

@vercel
Copy link

vercel bot commented Feb 20, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
lapis Ready Ready Preview, Comment Feb 20, 2026 9:31pm

Request Review

@fhennig fhennig self-assigned this Feb 20, 2026
@fhennig fhennig changed the title Llms txt feat(lapis): add llms.txt Feb 20, 2026
@fhennig fhennig marked this pull request as ready for review February 20, 2026 21:28
Copy link
Member

@chaoran-chen chaoran-chen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR! Just a few quick and spontaneous thoughts

Comment on lines +29 to +31
- Use any metadata field as filter (e.g., `country=Ghana`, `date>=2021-01-01`)
- `nucleotideMutations` - Array of mutations (e.g., `A123T`, `main:G234C`)
- `aminoAcidMutations` - Array of mutations (e.g., `S:501Y`, `ORF1a:3456K`)
Copy link
Member

@chaoran-chen chaoran-chen Feb 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could those examples (and those below) be problematic / misleading for instances that don't have these fields / segments / sequences?

Comment on lines +104 to +107
- Filters and fields are instance-specific based on database configuration
- Check `/sample/databaseConfig` for full configuration details
- Check `/sample/referenceGenome` for reference genome structure
- Some instances may require OAuth authentication (check with unauthenticated request)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it be better to list all the fields directly in this file? The databaseConfig may not be very clear to understand.

Copy link
Member

@chaoran-chen chaoran-chen Feb 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Have you tried it with different LLMs? I'd be curious to see how they perform. E.g. whether they know how to perform range searches, regexes, use advanced queries etc.

@fengelniederhammer
Copy link
Contributor

As discussed - I incorporated the changes in #1553

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

add llms.txt?

3 participants