Specializations

Data acquisition and ETL — Automated data work flow: acquisition of data from structured and unstructured sources and APIs. Development of solutions to acquire and manage data. Data movement from original sources, to data staging and cleansing repositories, to data warehouses, to data marts optimized for reporting and delivery. ETL (edit, transform, load) using scripting and open source ETL tools such as Talend. Data cleansing applications and processes. Production operations to capture data at regular intervals: once automations have been set up, we can staff inexpensively using offshore resources to monitor and sustain data flows.

Data modeling and metric definition — schema specification and creation, especially as needed to support efficient, high-performance reporting. Open-source modeling tools. Metric definition: identifying data elements and computations necessary to implement specific key performance indicators and other tracking vectors relevant to your business.

Report design, development, integration and deployment — BIRT reports, Pentaho reports. Deployment as standalone solutions or integrated with other applications, including web properties. Dashboard design. Selection and integration of appropriate charting and other visualizations.

Data analysis — making sense of data and drawing conclusions with lightweight tools, prior to creating business processes and applications. Interpretation of data.

Cloud computing — selection of cloud platforms. Provisioning of servers. Application and storage configuration. Application monitoring.