GT Gilligan Tech
Platform · AWS Bedrock

Open-weight models. Managed infrastructure. No lock-in.

AWS Bedrock gives Gilligan Tech access to both proprietary and open-weight models through a single managed API — with no per-seat licensing, full model transparency, and the security posture that regulated industries demand. For data-sensitive deployments, Llama on Bedrock is a privacy-first choice with no vendor training on your data.

30+
Foundation Models
One API — access to Claude, Llama, Titan, Mistral, and more.
100%
Managed Infra
No GPU provisioning, no model serving, no ops overhead.
0
Training on Your Data
Bedrock's API calls are never used to retrain any model.
SOC 2
Compliance
AWS Bedrock is SOC 2, ISO 27001, HIPAA, and GDPR compliant.
Capabilities

What AWS Bedrock enables for your business.

Open-Weight Model Access
Llama · Mistral · Full model transparency

Open-weight models on Bedrock give clients full visibility into model architecture and weights — critical for regulated industries where "black box" AI is a compliance risk. Llama 3.1 on Bedrock combines open-source transparency with AWS's enterprise security.

  • Meta Llama 3.1 70B — Open-weight reasoning for privacy-sensitive deployments
  • Meta Llama 3.1 8B — Cost-efficient open-weight inference for high-volume tasks
  • Mistral Large 2 — European-origin model with strong multilingual capability
📊
Enterprise Embeddings
Titan Embeddings · Cohere Embed · Vector search

Amazon Titan Text Embeddings V2 powers the vector retrieval layer in our RAG pipelines — turning every document corpus into a semantically searchable knowledge base that returns conceptually relevant results, not just keyword matches.

  • Titan Text Embeddings V2 — 1,024-dimension semantic embeddings for RAG
  • Cohere Embed v3 — Multilingual embeddings + reranking for precision retrieval
  • Amazon OpenSearch — Managed vector store with k-NN search, AWS-native
Cost-Optimised Inference
Nova Micro · Spot inference · Batch processing

For high-volume classification, extraction, and triage tasks, Bedrock's low-cost models — Amazon Nova Micro in particular — cut inference costs dramatically without sacrificing accuracy on structured tasks.

  • Amazon Nova Micro — Ultra-low-cost inference for classification and extraction
  • Amazon Nova Lite — Balanced speed and cost for moderate-complexity tasks
  • Bedrock Batch Inference — Async bulk processing at up to 50% cost reduction
🔒
Security & Data Isolation
VPC isolation · KMS encryption · Private endpoints

Bedrock runs inside your AWS VPC boundary with no data egress to the public internet. Combined with KMS-managed encryption and VPC endpoints, it satisfies even the most demanding enterprise and regulated-industry security requirements.

  • VPC Endpoints — Private connectivity; inference traffic never hits public internet
  • AWS KMS Encryption — Customer-managed keys for data at rest and in transit
  • CloudTrail Logging — Every API call logged for audit and compliance
Architecture

How Gilligan Tech deploys AWS Bedrock.

  1. Model routing: Incoming tasks are classified by complexity and data-sensitivity. Open-weight models (Llama) are selected for privacy-sensitive workloads; Titan for embedding generation; Nova for bulk classification.
  2. Document ingestion: Source documents are chunked, embedded via Titan Text Embeddings V2, and stored in Amazon OpenSearch or a compatible vector store. Chunk metadata is preserved for citation.
  3. Retrieval: User queries are embedded and matched against the vector store using approximate nearest-neighbour search. Top-ranked chunks are assembled into the LLM prompt context.
  4. Inference: The assembled prompt is sent to the appropriate Bedrock model. For privacy-critical clients, Llama 3.1 70B is used exclusively — no third-party model vendor receives the data.
  5. Audit trail: CloudTrail captures every Bedrock API call. Gilligan Tech's platform layer adds application-level logging of model, latency, token count, and retrieved sources.
Model Reference

AWS Bedrock models we deploy.

ModelProviderBest for
Llama 3.1 70B InstructMeta (via Bedrock)Privacy-first reasoning; regulated industry deployments
Llama 3.1 8B InstructMeta (via Bedrock)Cost-efficient open-weight inference for high-volume tasks
Amazon Nova MicroAmazonUltra-low-cost classification, tagging, extraction
Amazon Nova LiteAmazonBalanced cost and capability for production workloads
Titan Text Embeddings V2AmazonSemantic embeddings for RAG retrieval pipelines
Cohere Embed v3Cohere (via Bedrock)Multilingual embeddings + reranking for precision retrieval
Mistral Large 2Mistral AI (via Bedrock)European-origin multilingual reasoning

Deploy open-weight AI on your data.

We'll show you how Llama on Bedrock can power your document Q&A or support automation — with full data sovereignty and no black-box concerns.