MODIFICATION
D -- Data Services Modernization
- Notice Date
- 1/4/2016
- Notice Type
- Modification/Amendment
- NAICS
- 541519
— Other Computer Related Services
- Contracting Office
- Department of Labor, Office of the Assistant Secretary for Administration and Management, Office of Procurement Services, 200 Constitution Avenue, NW, S-4307, Washington, District of Columbia, 20210-0001, United States
- ZIP Code
- 20210-0001
- Solicitation Number
- 16ETAOITCNTR0014
- Archive Date
- 5/27/2016
- Point of Contact
- Daniel J. Rosenstengel, Phone: 2026937167, Rachel E. Johnson, Phone: 2026937969
- E-Mail Address
-
Rosenstengel.Danie@dol.gov, Johnson.Rachel.E@dol.gov
(Rosenstengel.Danie@dol.gov, Johnson.Rachel.E@dol.gov)
- Small Business Set-Aside
- N/A
- Description
- RFI for ETA DATA SERVICES Modernization SYNOPSIS: The United States Department of Labor's (DOL) Employment and Training Administration (ETA) is conducting market research to determine feasibility for a potential procurement requirement. This is NOT a solicitation for proposals, proposal abstracts, or quotations. The purpose of this notice is to obtain information regarding the availability and capability of all qualified sources to provide DOL with Data Services Capabilities hosted within a high availability FISMA Moderate compliant Cloud environment. The North American Industry Classification System (NAICS) code for this request is 541519 - Other Computer Related Services. OBJECTIVE: The ETA Office of Information Services and Technology (OIST) is seeking information on Data Services Capabilities (Online Analytical Processing Database and Visualization Tools) software, supporting software, platform, and infrastructure services (SaaS, PaaS, IaaS), and the labor needed to support the capability. This Data Services solution will foster sharing of data across program offices and departments allowing for end users to analyze data and identify trends and respond to ad hoc data requests from senior management and congressional inquiries. The following outcomes are expected: • Move ETA from low (Read Only Reports - all Access Reports) to a high function Business Intelligence capability during the next 12 months by leveraging technology investments to provide more information access and data sharing that is distributed across organizations. • Dramatically increase end-user knowledge of enterprise data at DOL through communities of interest and data governance to identify data needs, data security policies and identify common record types (i.e. grants performance) • Move from a fixed scalability technical approach to a dynamically scalable approach that allows us to rapidly onboard new data sources to meet strategic business while providing reporting to an unlimited number of users ANTICIPATED SCOPE: ETA is seeking a comprehensive Data Services solution that satisfies the minimum capabilities specified below. At a minimum the product(s) must be Cloud deployable, meet DOL Security Standards (FISMA Moderate, FedRAMP), and provide user access configuration and management based on role and group. The Data Services solution must consist of a fully capable suite of data warehousing capabilities and reporting tools to include the extraction, transformation, load, storage, analysis, and visualization of all enterprise data for business intelligence and other reporting uses. Data mart and report development and maintenance must be available for integration with all ETA technology products and organizations within the DOL enterprise. Data elements will contain Personally Identifiable Information (PII) and other sensitive information, which need to be adequately protected and secured. ETA is seeking Data Services solutions providing the following capabilities: • Data Management o Consumption and publishing of data from various sources and formats, e.g. relational databases (Oracle, MS SQL, Sybase), Web Services (SOAP, REST), Web APIs, unstructured data stores, social data, etc. o Data level security; this includes any capabilities that ensure that a user will only have access to query, analysis, edit, reclassify (in whole or aggregate) data that they have adequate permissions to perform such actions on o Data lifecycle management; capabilities or expertise with managing data from requirements to disposition and archiving, mapping ETA data to relevant Federal data standards and establishing new standards when none exist, and mapping existing to new data standards as they develop • Data Analysis o Inspecting and cleaning data to insure that the data in the system is of a quality needed to insure accurate reporting and analysis o Transforming data from one format to another, determining the most efficient approach in translation between unstructured, structured, and formal definitions and versions o Modeling data structure and usage that will enable ETA to extract greater value from its existing data and plan for future data needs o Creation and management of rules that can be applied to the data and reused across data sets to facilitate data transformation, error and consistency checking, reporting, and combining/mixing of data • Business Intelligence and Visualization o Self-service business intelligence and visual data discovery o Analytics capabilities that include statistical analysis, data mining, ad-hoc data categorization and sub-categorization o Standard and self-service reporting o Access via mobile platforms such as smartphones and tablets o Dashboard capable of displaying high-level Key Performance Indicators, predefine reports, and real-time metrics based on any data source available to the solution o Collaborative capabilities that allow users to share, comment, edit, and mix reports created by others • Services Management o Agile Data Warehouse management; capability to rapidly analyze new data and requirements, and integrate them into the platform ready for analysis and reporting o Continuous Improvement; how are software components of the solution kept up-to-date to the latest version without causing ETA to redo or update utilized functionality, e.g. reports, dashboards, ETL process, data definitions, security settings, etc. Capabilities Focus:  Managing and storing large volumes of data effectively  Partitioning for manageability  Storing data efficiently with compression  Loading and transforming data efficiently  Loading in batch and near real-time  Using set-based and row-based processing  Improving response times and scaling with parallelism  Using a consistent and flexible approach with external tables  Optimizing query performance  Improving response times and scaling with parallelism  Features for optimizing the physical data model  Managing and allocating resources with Database Resource Management  Monitoring and managing the database using graphical user interfaces  Managing optimizer statistics for optimum query performance  Complex queries that access large amounts of data  Loading and manipulating large volumes of data  Building indexes on large tables  Gathering Optimizer statistics  Backing up and restoring databases Data Services Modernization and Feature Requirements ETA's Information Management Reference Architecture including Data Management Layers and the progression of data from the Raw Data Reservoir to Foundation and then onto Access for increased data quality and enrichment, increased formalization of definition, increased "ease of use", simplified data model and reduced cost of query concurrency. ETA's Data Ingestion, responsible for moving, cleaning and transforming data, as near real-time streaming, Extract Transform and Load (ETL) and Extract Load and Transform (ELT). The Information Interpretation component to present information to ETA's external systems and services. It may interface with any of the data management layers. The Information Interpretation component will be implemented using a variety of Analytical, Data Science and BI tools. High-Level Enterprise Data Services Technical Requirements using the components in the Information Management Reference Architecture as a foundation, including a scalable and balanced hardware platform. An ETA-wide capability to store and manage large volumes of data cost effectively, while meeting reliability, availability and performance service levels. Functional capabilities for an efficient data load and transformation, in batch and real time scenarios as well as application-transparent optimizations for physical data models that will enable ETA systems to support high levels of concurrency and low query response times, even in ad-hoc query environments. Tools to monitor, manage and optimize the environment as a whole and integrated analytics and feature-rich reporting capability for high performance Information Interpretation. Data Modeling including Conceptual, Logical, Physical and Derived Model to answer the business questions and the logical relationships between different parts of the information and DOL's Governance and data integrity requirements may demand the use of certain constraints, primary keys and foreign keys and comparing the logical design with various physical attributes of enterprise database. Hardware Architecture considerations require building a balanced infrastructure that is both reliable and easy to scale out quickly requires considerable expertise and not only delivers a reliable, high performance platform, but scalability is also fundamental to its design. It should also provide high-end design for extreme IO performance; the ability to read and write data at very high rates. Absolute application transparency with enhanced scalability and performance is fundamental to the modern ETA Data Services design. Servers enabled scans and other resource-consuming features to be off-loaded away from the database servers and multiple nodes can be connected together using a unified InfiniBand fabric. New Data as a service solution requires high performance interconnects between database servers and the storage infrastructure because queries and transformations are likely to read and write large volumes of data and a high performance data channel or interconnect is required between database servers to enable parallel processing to scale across a cluster of machines. Industry level enterprise storage architecture that can distributes database IO across all available storage devices, and scale up (or down) capabilities in a seamless manner without outage. Assurance of efficiency and high performance for large table scans and other IO-intensive operations. All components in the storage infrastructure must be optimized to sustain high levels of IO throughput. ETA-wide managing high data volumes ad larger tables must be partitioned for optimal and scalable data management. Similarly subdividing partitions into smaller sub-partitions that can be used to for improving query join and scan performance. In addition data compression functionality to reduce the amount of storage space required for ETA's enterprise consolidated and relational environment. The advanced compression in different partitions can be subject to different compression choices including basic compression, OLTP compression (a component of the Advanced Compression option), and Hybrid Columnar Compression techniques. Similarly the Data Services modernization may require the implementation of Raw Data Reservoirs to ingest data from applications and deliver data from a staging area for high performance systems. It should be designed and tested carefully to ensure that they will be capable of delivering data at the required rate. Network storage devices are likely to require multiple network paths to the database servers. The design can be leveraged with Hadoop Distributed File Systems (HDFS) for storing and processing very large volumes of data. Moreover the new data services design should be capable enough to use external tables and provides a consistent, optimized and flexible approach to batch and micro-batch loading from raw data reservoirs implemented using "conventional" file systems. External tables created can be queried in the normal manner using standard SELECT statements. Master Data Management (MDM) based Data Services design should have the Batch Loading services capability for high-performance data loads. Direct path loading can be commonly used in combination with external tables. Functionality for data replication and streaming to capture Data Manipulation Language (DML) and Data Definition Language (DDL) changes made to database objects and replicate those changes to one or more other databases. ETA data streaming capability will require capture processes or synchronous capture captures changes made to source database objects and formats them into LCRs, which can be propagated to destination databases and then applied by data streams apply processes. Handling ETA data coming from multiple programs and concurrent quarterly loads parallelism for query performance and parallel execution is a key requirement to utilize all available hardware resources effectively: multiple CPUs, multiple IO channels, multiple storage arrays and disk drives, and large volumes of memory. Auto degree of Parallelism should be used to invoke parallelism when it is appropriate meeting the SLAs in Cloud set-up. Materialized views and data reusability approach for expensive repetitive queries to optimize runtime and OIST resource consumption. Queries will usually analyze a subset or aggregation of the detailed data, so a mechanism to pre-summarize and pre-aggregate data for direct use by queries offers the potential to improve query performance significantly for BI canned and ETA-wide ad-hoc reporting. The next generation System Management capabilities should include data workload management including important components like Real Application Clusters (RAC), Database Resource Manager, IO Resource Manager and Enterprise Manager Cloud Control (EMCC), including enterprise data monitoring and optimizer Statistics management. CURRENT ENVIRONMENT: The Data Services solution must be capable of interfacing with the existing environment in order to eventually displace this functionality. The current Data Warehouse development, test, and production environments are built upon the following set of tools: Environment #1 Data Tier: Oracle 10g/Oracle 11g OS: Oracle Solaris Space: 7TB Growth is approximately 5 GB per month. Environment #2 Data Tier: Sybase IQ/ MySQL OS: MS Windows Space: 2.4TB Growth is approximately 5 GB per month. CONSTRAINTS, ASSUMPTIONS AND CONSIDERATIONS: All responses should be sure to include recommendations for optimal bandwidth, data storage, architectural components and any additional requirements to ensure optimum SaaS, PaaS, and IaaS capabilities are maximized. ETA also desires to understand the best practices in managing onboarding of multiple data sources along with recommendations about how to manage loading of data into the warehouse including but not limited to architectural designs that provide maximum flexibility to consume large volumes of data while not limiting the access to the data from the users of the reporting tools (i.e. proposed methodologies, schedules, batch windows, etc). WHO MAY RESPOND: All capable businesses are invited to participate; however please note that responses to a possible follow-on Request for Proposal (RFP) will be limited to 8(a) small disadvantaged businesses. In your response to the RFI, please include whether or not you are a small business; HUBZone small business; service-disabled veteran-owned small businesses; 8(a) small business; women-owned small business; or small disadvantaged business. The North American Industry Classification System (NAICS) code for this request is 541519 - Other Computer Related Services. The small business size standard is $25 million. OPPORTUNITY TO DEMONSTRATE CAPABILITIES: All 8(a) vendors who submit responses will be contacted with the opportunity to conduct demonstrations that will be one hour in length. They will be composed of 40 minutes of demonstration and 20 minutes of Q&A. All demonstrations will be held at the U.S. Department of Labor at 200 Constitution Ave, Washington, D.C. 20210. The government shall provide one external internet connection that allows for access to the internet. INSTRUCTIONS: In accordance with FAR 4.1102, all prospective contractors shall be registered in the System for Award Management (SAM) database prior to award or agreement. The website for registration is www.sam.gov. All interested parties are required to submit a capability statement package that shall not exceed 15 pages including a cover letter that must cite the following information: 1. Response to RFI No. DOL.... ; 2. Vendor's Company Name, Address, Contact Person Information; 3. Vendor's DUNS Number; and 4. Business Size Standard/Classification. *** No questions will be answered at this time*** All contractor capability packages and/or responses to this RFI are due by January 11, 2016 @ 1:00 PM, EST to ROSENSTENGEL.DANIE@DOL.GOV
- Web Link
-
FBO.gov Permalink
(https://www.fbo.gov/spg/DOL/OASAM/WashingtonDC/16ETAOITCNTR0014/listing.html)
- Place of Performance
- Address: 200 Constitution Avenue, N.W., Washington, District of Columbia, 20210, United States
- Zip Code: 20210
- Zip Code: 20210
- Record
- SN03982031-W 20160106/160104234059-ac6f3cbc9fda0914d863f7ac45ca8a2d (fbodaily.com)
- Source
-
FedBizOpps Link to This Notice
(may not be valid after Archive Date)
| FSG Index | This Issue's Index | Today's FBO Daily Index Page |