Gemma3 1b instruct IQ4_NL from local GGUF server using BPP library

Gemma3 1b instruct is an open-source LLM supporting a 128k context window. This demo uses only 2K context.

The BPP library implements matrix multiplication with far less multiplications.

0 1
0 2
0 1
0 100
1 2000