What IT services does KGA provide?

KGA provides comprehensive IT support services including software installation and setup, SaaS system maintenance, application configuration, technical support, digital consulting (including website development), security services, and data management & backup solutions.

What areas do you cover?

Based in Kosai, Shizuoka, we provide remote support nationwide across Japan. On-site support is available primarily in the Tokai region.

Can I consult before signing a contract?

Yes, initial consultation and estimates are completely free. We will listen to your IT challenges and propose the optimal solution.

Is emergency support available?

Yes, the Premium plan includes 24-hour emergency support. The Standard plan also provides priority response during business hours.

Can you set up international TV apps?

Yes, we support the installation and configuration of international TV applications and media players. We help set up environments for legal access to international content.

Do you offer multilingual support?

We support 9 languages: Japanese, English, Portuguese, Korean, Chinese, Malay, Filipino, Vietnamese, and Spanish.

Are there any setup or hidden fees?

No. All prices displayed are final and tax-included. There are no setup fees, hidden charges, or surprise invoices. What you see is exactly what you pay.

Can I change plans later?

Yes. You can upgrade, downgrade, or cancel at any time. Upgrades take effect immediately and we will prorate the difference. Downgrades take effect at the next renewal cycle.

Which payment methods do you accept?

We accept all major credit cards (Visa, Mastercard, JCB, American Express) through Komoju, as well as bank transfers and convenience store payments in Japan. Invoicing is available for Business IT Plan customers.

Do you offer refunds?

Yes. We offer a 14-day money-back guarantee on all annual plans — no questions asked. Monthly Business IT Plan subscriptions can be cancelled at any time with prorated refunds for unused service.

What is the difference between the annual plans and the Business IT Plan?

Annual plans cover app configuration and support for individuals and small teams. The Business IT Plan is a comprehensive monthly subscription for companies that require website development, system management, automation, security, and a dedicated account manager.

Do you provide support in English?

Yes. Our team provides full multilingual support in Japanese, English, Portuguese, Korean, Chinese, Malay, Filipino, Vietnamese, and Spanish — by email, chat, and scheduled video calls.

Cloudflare Workers & Durable Objects Deep Dive: Designing Global State Machines — KGA Tech Blog

A Bird's-Eye View of the Workers Platform

Cloudflare Workers is a FaaS platform built on V8 Isolates. As of April 2026, the same code deploys instantly to 330+ points of presence. Cold starts are effectively zero (under 5ms for most requests), but because Workers uses Isolates rather than container launches as AWS Lambda does, Node.js native modules and long-running blocking I/O are fundamentally at odds with the runtime model. The `nodejs_compat_v2` flag, which went GA in the second half of 2025, dramatically improved compatibility for `fs`, `crypto`, and `stream` — but without understanding the Isolate execution model, you will still collide with the subrequest limit (1,000) or the CPU time limit (5 minutes on paid plans).

Workers reaches its full potential when stateful logic is combined with Durable Objects. A Durable Object (DO) is a "globally unique, single-threaded, strongly consistent actor." Requests to the same DO are always serialized. This model is designed as an edge-side replacement for mutex primitives, enabling use cases such as per-user rate limiting, per-meeting-room signaling, and per-transaction-ID inventory allocation — all without an extra round-trip to a central store.

The Durable Objects Transaction Model

DO persistent storage is backed internally by a SQLite-based storage API. Since SQLite-backed DOs became available to all plans in 2025, each DO can hold up to 10 GB of relational data, and multiple `put` and `delete` operations inside a `transaction()` callback execute atomically. The critical design detail is that DOs have both an input gate and an output gate. The input gate blocks new incoming requests while a storage write is in progress; the output gate holds back the fetch response until the storage write is durably committed. This two-gate design means application code can freely use `async/await` without risking consistency violations.

```typescript export class InventoryDO { state: DurableObjectState; constructor(state: DurableObjectState) { this.state = state; } async fetch(req: Request) { const { sku, qty } = await req.json<{ sku: string; qty: number }>(); return await this.state.storage.transaction(async (tx) => { const current = (await tx.get<number>(sku)) ?? 0; if (current < qty) return new Response("oos", { status: 409 }); await tx.put(sku, current - qty); return new Response(JSON.stringify({ remain: current - qty })); }); } } ```

The most common misuse of DOs in an actor pattern is designing one DO to handle all tenants. A hot DO gets pinned to a single PoP and forces geographically distant users to eat the RTT. The correct approach is to use `idFromName()` to concatenate tenant ID and resource ID for sharding, so writes land near geographic centers of gravity. Reads can be served from replica DOs (GA in 2025) with O(1) latency.

Workers KV, Hyperdrive, and D1: Knowing Which to Use

The three storage layers have clearly distinct purposes. Workers KV is an eventually-consistent, read-optimized store suited for configuration values and metadata where propagation within 60 seconds is acceptable. PUT propagation takes seconds to a minute; reads return from the local PoP cache in 1ms. Hyperdrive is a connection pooler for external Postgres or MySQL, reusing TCP connections to the origin while caching query results at the PoP. It is the most practical first step for organizations migrating an existing RDBMS workload to the edge. D1 is Cloudflare's own SQLite-based relational database; read replicas went GA in 2025, enabling a PostgreSQL-style topology where writes go to the primary and reads hit the nearest replica.

The decision framework breaks down by write frequency and consistency requirements. If writes exceed 1,000 per second and workloads can be tenant-partitioned, use DOs. For few global writes with geographically distributed reads, use D1. For an existing external database, use Hyperdrive. For TTL-based configuration distribution, use KV. R2 (S3-compatible object storage) is orthogonal to all of these and is chosen for video delivery, backups, and data lakes on the strength of free egress.

Workers AI and Inference Routing

Workers AI, GA since 2024, now offers major OSS models — Llama 3.3 70B, Mistral Large 2, GPT-OSS-120B, Qwen 3-series — on a pay-per-use basis. As of 2026, BYOK allows routing OpenAI, Anthropic, and Google calls through a unified API (AI Gateway), with response caching, retry logic, and rate limiting managed on the Workers side. AI Gateway logs stream into Analytics Engine, making per-prompt token costs queryable with SQL.

The standard latency optimization approach is a hybrid: small classification tasks handled by Workers AI local inference, complex reasoning escalated to an external LLM. For example, bot detection, NSFW classification, and language identification run on Llama Guard or Llama-3 8B via Workers AI; only "uncertain" results are escalated to Claude or GPT. KGA has documented multiple cases where this two-stage routing cut costs by 70%.

Hot-Path Optimization with Rust + WASM Bindings

TypeScript at 5ms, Rust + WASM at 0.8ms. That gap becomes relevant to CPU time budgets at 100,000 requests per second. The `workers-rs` Rust crate uses a `#[event(fetch)]` macro to compile to the Workers runtime, and `wasm-bindgen` provides access to JS APIs — fetch, crypto, R2. Rewriting JSON-heavy API gateways, HMAC validation, JWT verification, or image resizing in Rust yields meaningful cost reduction.

```rust use worker::*; #[event(fetch)] async fn main(req: Request, env: Env, _ctx: Context) -> Result<Response> { let body: serde_json::Value = req.json().await?; let signature = req.headers().get("x-signature")?.unwrap_or_default(); if !verify_hmac(&body, &signature, &env.secret("HMAC_KEY")?.to_string()) { return Response::error("invalid signature", 401); } Response::ok("ok") } ```

One caveat: keep WASM module size under 10 MB. Combining `wasm-opt -Oz` with `wee_alloc` typically cuts the binary in half. Cold compilation cost (20–50ms) is negligible thanks to Isolate reuse, but in extremely low-traffic PoPs where Isolates are rarely warmed, TypeScript can paradoxically be faster. Always benchmark on production PoPs.

Building Global State Machines

WebSocket Hibernation combined with Durable Objects has become the standard approach for real-time collaborative editing, live-stream signaling, and IoT device orchestration. The Hibernation API allows WebSocket connections to remain open while releasing memory — billing applies only during active processing time. Sustaining 1 million connections at a cost of a few thousand dollars per month creates the economic case for replacing managed services like Ably or Pusher with self-built infrastructure.

The key design principle is treating each DO as a single node in a state machine. State transitions CAS-update a `state version` inside a `storage.transaction`, external events are fired through Queues, and compensating transactions for failures are scheduled via the `alarm()` API. This makes it possible to implement a saga pattern — beginning at the edge and ending at the edge — at low latency without touching an origin server. Architecture that never leaves the edge is the defining challenge of distributed systems design in 2026.

Cloudflare Workers & Durable Objects Deep Dive: Designing Global State Machines