Skip to content

Releases: datenoio/iterabledata

v1.0.8

05 Jan 12:14

Choose a tag to compare

Release v1.0.8 (2026-01-05)

Added

  • AI Agent Integration Guides:
    • AGENTS.md - Comprehensive guide for integrating IterableData with LangChain, CrewAI, and AutoGen agents
    • GEMINI.md - Complete guide for using IterableData with Google Gemini AI for data processing and analysis
  • Documentation Enhancements:
    • Added docs/docs/api/capabilities.md - Capability matrix showing read/write/bulk/totals/streaming support by format
    • Updated Docusaurus configuration and sidebars
  • Development Tools:
    • dev/benchmarks/bench_import_open.py - Benchmarking tool for import performance
    • dev/scripts/inspect_zip.py - Utility for inspecting ZIP file contents
    • dev/scripts/verify_output.py - Output verification script
    • Moved find_missing_fixtures.py to dev/scripts/ directory
  • Examples:
    • examples/zipxml/ - New example demonstrating ZIP XML processing with README
    • Updated existing examples with improvements
  • Test Data:
    • Added testdata/test_zipxml.zip - Test fixture for ZIP XML processing
    • Added tests/test_property_roundtrip.py - New test for property roundtrip functionality

Improved

  • Format Detection: Enhanced iterable/helpers/detect.py with improved detection logic and better error handling
  • Compression Codecs: Updated all codec implementations (brotli, bz2, gzip, lz4, lzma, lzo, raw, snappy, szip, zip, zstd) with consistent patterns and improved error handling
  • Data Type Handlers: Refactored all datatype modules for better consistency, error handling, and code organization
  • Conversion Core: Improved iterable/convert/core.py with better format handling
  • Pipeline Processing: Enhanced iterable/pipeline/core.py with improved state management and error handling
  • Helper Utilities: Updated iterable/helpers/utils.py and iterable/helpers/schema.py with new functionality
  • Base Classes: Improved iterable/base.py with better abstraction and error handling
  • Test Suite: Comprehensive updates to all test files with improved fixtures and test coverage
  • Test Data: Updated compression test fixtures (br, bz2, gz, lz4, xz, zst) with corrected data
  • Documentation: Updated installation instructions and GitHub Pages setup documentation

Fixed

  • Removed obsolete test data files (test_convert_csv_json.json, test_mysqldump_*.sql, test_warc_roundtrip.warc)
  • Fixed compression codec implementations for better consistency
  • Improved error messages and handling across all modules

Release 1.0.7

15 Dec 10:29

Choose a tag to compare

[1.0.7] - 2025-12-15

Added

  • Performance Optimization Analysis: Comprehensive performance analysis document identifying critical bottlenecks and optimization opportunities
  • Development Documentation: Added performance optimization guide in dev/docs/PERFORMANCE_OPTIMIZATIONS.md

Improved

  • Documentation: Enhanced project documentation structure with development guides

Version 1.0.6

01 Nov 08:48

Choose a tag to compare

Release Notes - Version 1.0.6

Release Date: December 2024

Overview

Version 1.0.6 focuses on improving documentation, developer experience, and release automation. This release enhances the project's usability with comprehensive documentation and automated release workflows.

What's New

📚 Enhanced Documentation

  • Comprehensive README: Complete rewrite with detailed usage examples covering all major features
  • API Reference: Added comprehensive API documentation for all public functions and classes
  • Usage Examples: Expanded examples section with real-world use cases:
    • Format conversion workflows
    • Pipeline processing
    • DuckDB engine integration
    • Bulk operations
    • Excel and XML processing
  • Quick Start Guide: Added quick start section for immediate productivity

🤖 GitHub Actions Release Automation

  • Automatic Release Generation: New GitHub Actions workflow that:
    • Automatically triggers on version tags (e.g., v1.0.6)
    • Verifies version consistency between tag and package
    • Runs full test suite before release
    • Builds source distribution and wheel packages
    • Creates GitHub releases with proper formatting
    • Supports optional PyPI publishing
  • CI/CD Integration: Streamlined release process with automated validation

📖 Changelog and Release Tracking

  • Added CHANGELOG.md following Keep a Changelog format
  • Release notes tracking for future versions
  • Better version history visibility

Improvements

Documentation

  • Better organized README with clear sections
  • More practical, copy-paste ready examples
  • Improved formatting and readability
  • Added missing API documentation

Developer Experience

  • Automated release process reduces manual steps
  • Clear version management workflow
  • Better onboarding for new contributors

Installation

pip install iterabledata==1.0.6

Or upgrade from a previous version:

pip install --upgrade iterabledata

Migration Guide

No breaking changes in this release. All existing code should work without modification.

Full Changelog

See CHANGELOG.md for detailed changes.

Contributing

We welcome contributions! Please see the Contributing section in the README for guidelines.

Feedback

If you encounter any issues or have suggestions, please open an issue on GitHub.

Acknowledgments

Thank you to all contributors and users of iterabledata!

First release

14 Jun 07:30
fdb6109

Choose a tag to compare

Basic support for most iterable file formats and compression codecs