informix-db

Author	SHA1	Message	Date
Ryan Malloy	2bacbc4e53	Phase 6.a: DECIMAL/MONEY row decoding works (COUNT/SUM/AVG return Decimal) Before: cur.execute('SELECT COUNT() FROM systables') cur.fetchone() # → (b'\xc2\x02\x00\x00\x00\x00\x00\x00\x00',) raw bytes After: cur.execute('SELECT COUNT() FROM systables') cur.fetchone() # → (Decimal('276'),) The trickiest decode of the project so far. IDS DECIMAL/MONEY wire format: byte[0] = (sign << 7) \| biased_exponent_base100 bit 7 = sign (1=positive, 0=negative) bits 0-6 = (exponent + 64), XOR'd with 0x7F if negative byte[1..] = digit-pair bytes (each 0..99 = two BCD digits) if negative: asymmetric base-100 complement applied: walk digits right→left, trailing zeros stay zero, first non-zero subtracts from 100, rest from 99 Initial naive "99 - d for all digits" decoder gave artifacts like -1234.559999 instead of -1234.56. The asymmetric complement rule (from Decimal.decComplement line 447) is what makes negatives round-trip exactly. Width on the wire: per-column encoded_length packed as (precision << 8) \| scale; byte width = ceil(precision/2) + 1. parse_tuple_payload uses this to slice DECIMAL columns correctly. Tested cases all decode correctly: COUNT(*) → Decimal('276') SUM(tabid) → Decimal('55') AVG(tabid) → Decimal('5.5') 1234.56::DECIMAL → Decimal('1234.56') -1234.56::DECIMAL → Decimal('-1234.56') -0.5::DECIMAL → Decimal('-0.5') -99.99::DECIMAL → Decimal('-99.99') -12345678.9::DECIMAL → Decimal('-12345678.9') NULL → None Encoder (_encode_decimal) is implemented but disabled — server rejects the produced bytes (precision packing not quite right). Phase 6.x will fix. Workaround: cast Decimal to float, or pass via SQL literal. Module changes: src/informix_db/converters.py: + decimal module import + _decode_decimal — full BCD decoder with asymmetric complement + _encode_decimal (Phase 6.x stub — present but unreached) + DECIMAL/MONEY added to DECODERS dispatch src/informix_db/_resultset.py: + DECIMAL/MONEY width computation from encoded_length Tests: 40 unit + 55 integration (8 new DECIMAL) = 95 total, all green, ruff clean.	2026-05-04 11:17:59 -06:00
Ryan Malloy	34ad04a872	Phase 2.x: VARCHAR row decoding works — three byte-level fixes Three findings, each caught by a different debugging technique, documented in DECISION_LOG.md: 1. CURNAME+NFETCH PDU: trailing reserved field is SHORT not INT. Caught by byte-diffing our 44-byte PDU against JDBC's 42-byte reference under socat. The server tolerated the longer version for INT-only SELECTs (silently consuming extra zeros) but rejected it for VARCHAR queries. Lesson: server tolerance varies by query type — always match JDBC byte-for-byte. 2. SQ_TUPLE payload pads to even byte alignment. An 11-byte "syscolumns" VARCHAR payload had a trailing 0x00 between it and the next SQ_TUPLE tag. JDBC's IfxRowColumn.readTuple consumes this pad silently; we weren't, so any odd-length variable-width row desynced the parser. 3. VARCHAR/NCHAR/NVCHAR in tuple data use a SINGLE-byte length prefix (max 255 chars — IDS VARCHAR's hard limit). NOT a 2-byte short as I'd initially assumed. CHAR is fixed-width per encoded_length. LVARCHAR uses a 4-byte int prefix for >255 byte values. Module changes: src/informix_db/_resultset.py — _LENGTH_PREFIXED_SHORT_TYPES set, branched VARCHAR/NCHAR/NVCHAR (1-byte prefix) vs CHAR (fixed) vs LVARCHAR (4-byte prefix); even-byte alignment pad consumed after each SQ_TUPLE payload. src/informix_db/cursors.py — CURNAME+NFETCH and standalone NFETCH PDUs now write_short(0) for the reserved trailing field. Tests: 40 unit + 18 integration (3 new VARCHAR tests) = 58 total, all green, ruff clean. New tests cover: - VARCHAR single-column SELECT - Odd-length VARCHAR row (regression for the pad-byte bug) - Mixed INT + VARCHAR + FLOAT three-column SELECT Sample output: SELECT FIRST 5 tabname FROM systables → ('systables',), ('syscolumns',), ('sysindices',), ('systabauth',), ('syscolauth',) SELECT FIRST 3 tabname, tabid, nrows → ('systables', 1, 276.0), ... VARCHAR was the last known gap from the Phase 2 commit. Phase 2 now reads INT, BIGINT, REAL, FLOAT, CHAR, VARCHAR end-to-end. Phase 6+ types (DATETIME, INTERVAL, DECIMAL, BLOBs) remain.	2026-05-04 07:55:13 -06:00
Ryan Malloy	a1bd52788d	Phase 2: SELECT works end-to-end — pure-Python Informix fully reads data cursor.execute("SELECT 1 FROM systables WHERE tabid = 1") cursor.fetchone() == (1,) To my knowledge, this is the first time a pure-Python implementation has read data from Informix without wrapping IBM's CSDK or JDBC. Three breakthroughs in this commit: 1. Login PDU's database field is BROKEN. Passing a database name there makes the server reject subsequent SQ_DBOPEN with sqlcode -759 ("database not available"). JDBC always sends NULL in the login PDU's database slot — we now do the same. The user-supplied database opens via SQ_DBOPEN in _init_session. 2. Post-login session init dance: SQ_PROTOCOLS (8-byte feature mask replayed verbatim from JDBC) → SQ_INFO with INFO_ENV + env vars (48-byte PDU replayed verbatim — DBTEMP=/tmp, SUBQCACHESZ=10) → SQ_DBOPEN. Without all three steps in this exact order, the server silently ignores SELECTs. 3. SQ_DESCRIBE per-column block has 10 fields per column (not the simple "name + type" my best-effort parser assumed): fieldIndex, columnStartPos, columnType, columnExtendedId, ownerName, extendedName, reference, alignment, sourceType, encodedLength. The string table at the end is offset-indexed (fieldIndex points into it), which is how JDBC handles disambiguation. Cursor lifecycle implementation in cursors.py mirrors JDBC exactly: PREPARE+NDESCRIBE+WANTDONE → DESCRIBE+DONE+COST+EOT CURNAME+NFETCH(4096) → TUPLE*+DONE+COST+EOT NFETCH(4096) → DONE+COST+EOT (drain) CLOSE → EOT RELEASE → EOT Five round trips per SELECT — same as JDBC. Module changes: src/informix_db/connections.py — added _init_session(), _send_protocols(), _send_dbopen(), _drain_to_eot(), _raise_sq_err(); login PDU now forces database=None always; SQ_INFO PDU replayed verbatim from JDBC capture (offsets-indexed env-var format too gnarly to derive in MVP). src/informix_db/cursors.py — full rewrite: real PDU builders for PREPARE/CURNAME+NFETCH/NFETCH/CLOSE/RELEASE; tag-dispatched response readers; cursor-name generator matching JDBC's "_ifxc" convention. src/informix_db/_resultset.py — proper SQ_DESCRIBE parser per JDBC's receiveDescribe (USVER mode); offset-indexed string table with name lookup by fieldIndex; ColumnInfo dataclass with raw type-code preserved for null-flag extraction. src/informix_db/_messages.py — added SQ_NDESCRIBE=22, SQ_WANTDONE=49. Test coverage: 40 unit + 15 integration tests (7 smoke + 8 new SELECT) = 55 total, all green, ruff clean. New tests cover: - SELECT 1 returns (1,) - cursor.description shape per PEP 249 - Multi-row INT SELECT - Multi-column mixed types (INT + FLOAT) - Iterator protocol (for row in cursor) - fetchmany(n) - Re-executing on same cursor resets state - Two cursors on one connection (sequential) Known gap: VARCHAR row decoding doesn't yet handle the variable-width on-wire encoding correctly. Phase 2.x will address — for now NotImpl errors surface raw bytes in the row tuple.	2026-05-03 15:37:10 -06:00
Ryan Malloy	e2c48f855e	Phase 2 progress: cursor scaffolding + protocol findings (SELECT path WIP) Cursor class scaffolded with full PEP 249 surface: src/informix_db/cursors.py — Cursor with execute, fetchone, fetchmany, fetchall, description, rowcount, arraysize, close, iterator, context manager. Sends SQ_COMMAND chains for parameterless SQL (Phase 4 adds SQ_BIND/SQ_EXECUTE for params). src/informix_db/_resultset.py — ColumnInfo, parse_describe, parse_tuple_payload. Best-effort SQ_DESCRIBE parser; refines in Phase 2.1. src/informix_db/connections.py — Connection.cursor() now returns a real Cursor; new _send_pdu() lets Cursor share the connection's socket without violating encapsulation. Protocol findings landed in PROTOCOL_NOTES.md §6: §6a — SQ_PREPARE format with named tags (the "trailing 22, 49" are SQ_NDESCRIBE and SQ_WANTDONE chained into the same PDU). Confirmed against IfxSqli.sendPrepare line 1062. §6c — Server requires post-login init sequence (SQ_PROTOCOLS → SQ_INFO → SQ_ID(env vars) → SQ_DBOPEN) BEFORE any PREPARE works. Discovered the hard way: PREPARE without this sequence gets no response; SQ_DBOPEN without SQ_PROTOCOLS gets sqlcode=-759 ("Database not available"). The login PDU's database field is a hint, not an open. §6e — SQ_TUPLE corrected: [short warn][int size][bytes payload] (not [int 0][short payloadLen] as earlier draft claimed). Two more constants added to _messages.MessageType: SQ_NDESCRIBE = 22, SQ_WANTDONE = 49 Tests: 40 unit + 7 integration (added 2 new — cursor() returns a Cursor, parameter binding raises NotSupportedError). All green, ruff clean. Removed obsolete "cursor() raises NotImplementedError" test. What works end-to-end now: connect, cursor(), close, parameter-attempt gating. What doesn't yet: cursor.execute("SELECT 1") — server requires the post-login init sequence we don't yet send. Discovered captures (kept for next session's analysis): docs/CAPTURES/06-py-select1-attempt.socat.log docs/CAPTURES/07-py-replay-jdbc-prepare.socat.log docs/CAPTURES/08-py-with-dbopen.socat.log docs/CAPTURES/09-py-full-replay.socat.log Three new tasks created tracking the remaining Phase 2 blockers: post-login init sequence, proper SQ_DESCRIBE parser, SQ_ID action vocabulary helpers.	2026-05-02 21:04:30 -06:00

4 Commits