JP Patent App. 2026-046620 (independent product)

PSDP — Same-Language Parallelization

Inject deterministic parallelization into your existing applications — without rewriting source code.

PSDP (Phase-Synchronous Deterministic Parallelism) is JAVATEL's independent runtime-injection product line for parallelizing existing Java, C#, Rust, Go, Kotlin and 11 more — 16 languages in total, delivering 2.97–4.23× speedup with bit-exact output and a hash-chained audit trail.

Slime-Boy Slime-Boy: Wait — 4× speedup without changing my source? Really? ▾

“No source rewrites” — how is that even possible?

  • Your OrderBatch.java stays byte-for-byte identicalgit diff shows nothing.
  • At build time, PSDP reads the bytecode, finds the parallelizable regions, and injects a parallel version alongside (Java agent / .NET profiler API / Rust proc-macro — the mechanism is language-specific).
  • At runtime: 1 core → sequential path. 4 cores → parallel path. You write no dispatch code.
  • The classic “parallel changed my answer” failure mode is excluded by theory via PSDP’s proprietary mathematical synchronization mechanism (details disclosed in patent filings and published paper; redacted here).
  • Every production run, bench compares sha256 of sequential vs parallel output. Mismatch → instant rollback.

So: not one line of your code touched / just rebuild / output unchanged at the bit level / 4× on 4 cores (precisely 2.97–4.23×, depending on cores and algorithm shape). For audit, “zero code change” is the selling point.

Peak speedup
4.23 ×
Bit-exact runs
378 / 378
Fault rollbacks
90 / 90
Determinism (378/378)
100%

Try the service (PSDP) → Contact us Use cases Licensing

Supported languages

PSDP generates a sequential / parallel pair within the same language. Production-ready across 16 languages, from enterprise to systems to scientific computing.

Enterprise Java 8 / 17 / 21 · C# · Kotlin (+coroutines) · Scala
Systems Rust · Go · C / C++ · Swift
Scripting Python · Node.js · PHP
Functional / Scientific Clojure · Erlang · Common Lisp · FORTRAN

Use cases

Financial trading systems

Parallelize overnight batch and risk computation; cut window time by 3× with cent-exact bit-exact guarantee. SAFETY_MAX profile satisfies audit needs.

Regulated Java / C# parallelization

Parallelize finance and insurance core systems under J-SOX / SEC oversight, with hash-chained audit trail. Result invariance enables recompute, comparison, and audit.

Cloud cost reduction

3–4× speedup on existing workloads = 1/3 the infrastructure. Compress AWS / GCP / Azure instance count and size with no code changes.

HFT (high-frequency trading)

Low-latency order processing and market-data parallelization. GAME_MIN profile minimizes safety overhead to prioritize frame time.

Scientific computing

Bit-exact parallelization of FFT, Conv2D, LU decomposition. Eliminates the "parallel result differs, can't compare" failure mode.

ML inference acceleration

Parallelize trained-model inference pipelines with deterministic guarantees. Compatible with A/B comparison, regression validation, and security audit.

How it works — mathematical synchronization for bit-exact parallelism

The sufficient condition for parallel execution to return the same result as sequential is derived from PSDP’s proprietary mathematical synchronization mechanism over relational operators. At runtime, PSDP enforces this condition below a threshold and gives an analytic upper bound on numerical drift — eliminating the "numbers change when parallelized" phenomenon at the theory level. (Theoretical detail is disclosed in patent filings and the published paper; redacted on this page.)

// PSDP applied (OrderBatchProcessor_PSDP.java, excerpt)
// - sha256 of input/output exactly matches the original
//   sequential code (OrderBatchProcessor_ORIGINAL.java)
public class OrderBatchProcessor {
    public Result process(List<Order> orders) {
        return orders.parallelStream()        // only the parallel call differs
            .map(this::settle)
            .collect(PSDP.phaseSyncReduce(...));   // phase-synchronized reduce
    }
}

The diff between sequential and parallel is a few API calls only; no logic rewrites. Bench regression-tests sha256 equality between sequential and parallel output every CI run.

Technical specifications

  • Injection methodRuntime injection. No source code rewrites; rebuild only.
  • Bit-exact guaranteesha256 of sequential and parallel outputs match exactly. Eliminates the typical "parallel changes the numbers" failure mode.
  • Mathematical sync guaranteePSDP’s proprietary synchronization condition is enforced at runtime, yielding an analytic upper bound on numerical drift (theoretical detail redacted).
  • Hash-chained auditFNV1a64 + SHA-256 monotonic chain. 1-byte tampering is DETECTED. Append-only, no after-the-fact edits.
  • Fault-injection rollback90/90 scenarios with zero barrier violations. Tombstone-based past-state restoration.
  • 11 safety mechanismsCovered by claims 29-32, configurable across 5 profiles (GAME_MIN to SAFETY_MAX).

Verified benchmarks

CategoryKernelSpeedupNotes
ComputeNENC (numerical equivalence)3.38 ×Near-linear in cores, CPU-bound
GraphGRA (graph kernels)2.17 ×Memory-bandwidth ceiling, bit-exact preserved
Database (OLAP, partition-parallel)TPC-H Q1 / Q6 column store, 12 M rows4.38 – 5.20 ×1c → 8c on Ryzen 9 7945; 25 / 25 runs bit-exact vs reference
Database (CDC replay)commutativity-aware 20 M-op WAL replay4.26 ×vs serial baseline; 25 / 25 final-state hash matches
Database (multi-conn read)SQLite WAL parallel-reader throughput9.76 ×14.2 → 138.6 qps, 1c → 12c
SIMDsvt_av1_quantize_fp (AVX-512)3072 / 3072Bit-exact PASS, ±5% vs AVX2 (memory-bound)

378 runs all bit-exact (100% PASS) / 90/90 fault-injection rollbacks are detailed in the PSDP paper (JP v5d, 910 KB PDF, 2026-03-04). Database numbers above re-measured 2026-05-18 on AMD Ryzen 9 PRO 7945 (12c / 24t, OpenJDK 21); the earlier “DB 1.02 ×” figure was a wall-time aggregation artefact (workload-normalisation across non-PSDP phases) and is superseded by the partition / CDC / WAL-reader cases shown here. 75+ runs all deterministic.

Versus alternatives

Intel TBBParallelization framework, but requires source rewrites; no bit-exact guarantee. PSDP: no source changes + bit-exact + audit trail.
OpenMPpragma-based, requires explicit annotation; reproducibility is implementation-dependent. PSDP: runtime injection + result-invariance + audit fitness.
Java parallelStreamResult order non-deterministic; reduce on floats is non-reproducible. PSDP's phaseSyncReduce makes reduce order deterministic too.
C# Parallel.ForEachSame: side-effect order non-deterministic. PSDP provides bit-exact within-language parallelism.
GPU parallelism (CUDA / Metal)Cross-language, results are arch-dependent. PSDP stays within the same language and same binary, runs on CPU.

Where PSDP excels (memory-bound large batches & compute-intensive)

PSDP delivers 2.97-4.23x speedup on ms-class or larger compute-intensive workloads where Bernstein commutativity holds. Particularly effective domains:

ML inferencematmul / convolution dominant, batch latency hundreds of ms to seconds, PSDP 2.97-4.23x
HFT computationTick-by-tick valuation, easy Bernstein region split, low-latency parallel
Scientific computingFEM / fluid / molecular dynamics, clear domain decomposition + sync points
Large-scale batch DB walkSlimeTree-VSAM-style in-memory backends with sequential cursor (memory-BW bound) — PSDP 6-8x scaling expected

Real example: SlimeTree-VSAM 1-billion-record sequential batch (synergy case) — SlimeTree-VSAM standalone is already 480× faster than PostgreSQL in same-host bench (1B sequential = 4.4 min vs PostgreSQL 19.5 hours). Adding PSDP integration for 8-thread parallel walk, even with memory BW sharing, gives an expected 4.4 min → 30-60 sec (6-8x scaling). Combined: 1,900-3,800× faster than PostgreSQL — the only solution that fits the broken overnight-batch window in large financial mainframes.

Projection synthesized from SlimeTree-VSAM's impl_v6 micro / scale bench (Zen 4 / WSL2 same host) and PSDP's ML inference bench 2.97-4.23x scaling. SlimeTree-VSAM was announced in the 2026-05-24 news.

Workloads where PSDP gives limited benefit

The above is PSDP's domain. By contrast, the patterns below see small speedups or none. We disclose this for transparency.

I/O-boundIf the CPU is idle waiting on DB / network, PSDP cannot shorten wait time — PSDP parallelizes compute, not I/O. Right tool: async I/O or queue separation (outside PSDP scope).
HashMap / tree-walk dispatch-heavyPointer-chasing code with high branch density where Bernstein commutativity does not hold — synchronization overhead exceeds gains.
Tiny workload (~ms or less)Bernstein setup is on the ms scale; if each task is μs-class, relative overhead dominates. Batch tasks together first.
Already parallel (rayon / Tokio etc.)Exclusive with other within-language parallelism. Requires removing the existing parallel layer before PSDP injection.

Real example: SlimeTree-RLM impl_v4 (Rust) — HashMap-heavy / dispatch-bound code (Hot Shelf treap, Cold Shelf RB-tree, SCC decomposition). Per-slot operations run at μs scale, making Bernstein synchronization overhead disproportionately large. Estimated PSDP injection speedup is only 1.0-1.1x — not PSDP's domain (compute-intensive matmul-class), so PSDP application is deferred.

PSDP delivers 2.97-4.23x on algorithms where computation itself dominates time (ML inference, HFT, scientific computing). For codebases matching the 4 patterns above we recommend alternative approaches (async I/O, cache optimization, batching).

Pricing & licensing (Provisional, 2026-05-18 — example at 10,000 LOC)

PathLOCBuy-in / LOCBuy-in totalRuntime rateRuntime / yearSupport
SlimeNENC (converter)10,000US$ 9.95US$ 99,500Full
PSDP for SlimeNENC customer10,000US$ 0.00US$ 02%/yearUS$ 1,990Full
PSDP direct (via SIer / partner)10,000US$ 1.00US$ 10,00020%/yearUS$ 2,000Full (via partner)
PSDP end-user direct (30% OFF)10,000US$ 0.70US$ 7,00020%/yearUS$ 1,400FAQ only

Across the first three paths, the per-LOC annual runtime cost is designed to converge to the same band; only the one-time buy-in differs, reflecting the value of the SlimeNENC converter tool when bundled. The end-user-direct path is for self-deploying customers (FAQ-only support, no SI engagement — for hands-on help please go through a partner).

Activation methodEd25519-signed license + 3-hop air-gap activation (for financial institutions).
License serverlicense.javatel.jp (AWS Lightsail, HSTS / OCSP stapling)
PatentJP App. 2026-046620 (no contractual renegotiation between paths)

For SIer / partners: partners selling at the direct list price earn a substantial margin on the one-time buy-in, and the same margin applies recurringly to the annual runtime subscription — year after year, for as long as the customer keeps PSDP in production. Concrete tier numbers are part of the partner agreement and shared after registration. See /partners/ for partner enrolment.

Direct-customer Editions (Provisional, 2026-05-18 — subject to change)

EditionPSDP-managed LOC capOne-time buy-inAnnual runtimeNotes
Trial10,000US$ 0US$ 0 (14 days)Full features, non-commercial
Individual50,000LOC × $1.00 (cap $50)LOC × $0.20Community support, commercial use OK
Corporate1,000,000LOC × $1.00LOC × $0.20Email support, SLA 99%, audit-chain export
EnterpriseunlimitedLOC × $1.00LOC × $0.20 (cap $1,000,000/year)24h SLA 99.9%, 3-hop air-gap activation, finance/medical
OEMunlimited + relicensenegotiatednegotiatedSource-escrow option, embeddable runtime

“PSDP-managed LOC” = the line count of source files inside the namespaces / packages declared as PSDP-managed in psdp.toml, measured by cloc/tokei against the year-initial git revision. Runtime self-reports the same figure for cross-verification. The above schedule is provisional: caps, minimums, and SLA tiers may be revised before the GA pricing announcement.

Patents

  • JP Patent App. 2026-046620 (PSDP core / Phase-Synchronous Deterministic Parallelism)
  • JP Patent App. 2026-046625 (phase-sync implementation, 11 safety mechanisms, claims 29-32)
  • Additional applications in preparation (SIMD / GPU extensions)

Related products

SlimeNENC family: an independent family that bit-exact-modernizes COBOL / JCL / MUMPS legacy assets into Java, Rust, C#, Kotlin, Go. The natural pairing — SlimeNENC migrates legacy to Java, then PSDP injects parallelization on the migrated Java — requires no additional contractual negotiation on the customer side.

Try the service (PSDP) → Contact us SlimeNENC family All products