The continuous development journey of a platform designed to enhance data management experiences, making them more powerful and efficient.
Throughout the development of Blendata Enterprise, we have consistently advanced the data management experience in every dimension—from supporting data connectivity across multiple sources, enabling automated data management, and enhancing processing performance for greater speed and accuracy, to designing a system that truly meets users’ real-world needs.
Every new feature introduced is more than just a technical update. It represents the outcome of learning from real-world usage by customers and partners, reflecting Blendata’s core philosophy: “Simplify Big Data Platform.” Our goal is to build a platform that is comprehensive yet uncomplicated, enabling every organization to access the power of data and AI with greater ease than ever before.
The following are the key milestones in the evolution of Blendata Enterprise—each version demonstrating our ongoing commitment to advancing technology.
Development Details of Each Version
Blendata Enterprise has continuously evolved across every version to enhance user experience in terms of performance, flexibility, and system security. This article outlines major updates from version 4.4.0 to 4.6.11, highlighting the critical changes that have propelled the platform forward.
Version 4.4.0 – Introducing Automation Features
This version marked a major shift with the launch of Workflow Management, allowing users to design, sequence, and control processes automatically, along with key capabilities such as:
- Create workflows with an intuitive Diagram View
- Define Success/Fail conditions for each task
- Receive In-app and email notifications
- Additional features included a full CRUD Function (Create, Read, Update, Delete) in Data Exploration, enhancements to the SQL Editor Workspace, and Kafka Connector management directly via the web interface.
Version 4.5.0 – Enhancing Workflow and Data Tracking Capabilities
Introduced new tools for improved visibility and control over data processes:
- Data Lineage: Displays end-to-end data flow for transparency and traceability
- Advanced Workflow Mode: Supports Python scripting within workflows
- New License Management: Supports both online and offline modes with real-time CPU usage monitoring
- Additional enhancements included Table Size Statistics, Delta Compact & Vacuum, and Spark Meta-store synchronization.
Version 4.5.1 – Upgraded Workflow and Automated Data Integration
Focused on improving Workflow and Notebook performance:
- Run Parallel Workflow: Execute multiple notebooks simultaneously for faster processing
- REST API Importing: Import data through APIs with dynamic parameters (GET, POST, PUT supported)
- Encrypted User SDK: Enables SDK execution through Workflow Management and Run Scheduler within Notebooks
- Supports importing data from Notebooks to Data Catalog via SDKs for Spark and Python
Version 4.5.2 – System Performance Optimization
Improved internal caching mechanisms to reduce requests, increasing overall system speed and stability.
Version 4.6.0 – Revamped Authentication and Enhanced Data Management
This version marks another major milestone for Blendata Enterprise, expanding its integration and data management capabilities even further.
- Third-Party Authentication: Supports Microsoft AD FS and Azure for flexible enterprise connectivity
- Personal Access Token: Users can generate and manage personal tokens, set expiration, enable/disable, or delete them instantly
- Create Table Schema for Delta Table: Add schema creation for Delta assets with integrated compaction and vacuum options
- Import & Aggregate to Existing Table: Direct import to existing tables (One-time, Schedule, or Stream)
- Databricks Ingestion: Connect and extract data from Databricks for analysis in Blendata
- Compute Task Monitoring: Track and manage Zeus/Spark jobs directly via the UI
- Data Lineage for Workflow: Visualize workflow data flow and relationships
- Service Monitoring: Display overall system services with real-time job graphs
- BDE Assistant (SQL Generator): Converts human language into SQL commands
- Auto Migration: Automates database upgrades, simplifying installation and environment setup
Version 4.6.1 – Flexible Workflow without Scheduling
Enhanced Workflow Management flexibility:
- Support for running workflows without a defined schedule
- New service type to trigger subsequent workflows directly
Version 4.6.2 – Improved Notebook Tracking and Synchronization
Increased transparency and continuity in Notebook operations:
- Record user data each time a Notebook runs
- Sync Notebook Profiles between Zeppelin and Git in both directions, with a “Hard Pull” feature to resolve file conflicts
Version 4.6.3 – Enhanced SQL Operations and Notebook Management
- Introduced a new menu, Manage Extensions Apps, supporting future extension installations.
Version 4.6.4 – Improved Git Integration and Notebook System
- Supports Git branch switching via GUI with permission management.
Version 4.6.5 – Enhanced User Experience and Responsiveness
- Improved UX in the main menu and notebook list arrangement.
Version 4.6.6 – Easier Git and Notebook Management
- Added Push to Git button in UI, eliminating the need to use Version Control menus.
- Supports selecting multiple notebooks simultaneously for permission management or exporting as ZIP files.
Version 4.6.7 – Enhanced SQL Authoring and External Data Access
- Supports External Table Access within Notebooks
- Allows CREATE/DROP TABLE via SQL commands for Spark Internal, Delta, and View Tables directly in Notebooks
Version 4.6.8 – Advanced Permission Control and Scoped Mode in Zeppelin
- Enhanced schema-level and individual data asset permission management
- Introduced Scoped Mode in Zeppelin for resource control
- Custom Service now supports Compute Pool assignment for workload isolation
Version 4.6.9 – Launch of Notebook Workspace with Full Git Integration
- Supports multiple workspaces in Notebook with full Git integration (Push, Sync, and Branch Management)
- Expanded CREATE/DROP TABLE via SQL to other UIs, such as Data Exploration and SQL Editor
Version 4.6.10 – More Flexible Data Structure and Workspace Management
- Supports default storage configuration per schema
- Improved Workspace Management, including Git credential updates and multi-workspace note uploads
- Added Z-ordering Optimization in Delta Compaction for faster queries
- Supports Ownership Transfer for LDAP/SSO users
Version 4.6.11 – Improved the UX of Workflow and Workspace for a more seamless and convenient experience.
.
Blendata’s Commitment to Future Development
Every version of Blendata Enterprise is shaped by real user feedback and close observation of global technology trends. Our development is not limited to solving current challenges but also anticipates future needs—covering AI integration, complex Big Data management, and the creation of a platform ready to scale with every organization over the long term.
Follow Blendata’s release notes and feature updates at: https://www.blendata.com/knowledge-sharing/release-notes/
*Disclaimer: All third-party trademarks mentioned are the property of their respective owners.