The World According to David Pejcoch
Profile
Ing. David Pejcoch, DiS. Ph.D. CAPM
David is experienced Data Management professional, Emerging Technologies enthusiast and passionate Data Scientist with more than 20 years of work experience. He loves to solve challenging tasks, delivering creative solutions helping customers to solve their business problems and reach their goals on their Digital Transformation journey. He’s known for his strong determination to deliver projects in promised deadlines and with high quality. He constantly keeps extending his knowledge via online learning.
- 20+ years of experience, 7 years in Insurance, 6 years in Banking, 3 years in e-Commerce, 4 years in Consulting, 3 years in Retail, 1 year in Transportation and Logistics
- Data Management, Project Management, Data Quality Management, Advanced Analytics, Data Science, CRM
- Academic background (10+ years of teaching / coaching / research)
David holds European passport as well as lifetime permission to stay in UK under EU Settlement Scheme and he is keen to travel.
SQL (Teradata, Oracle, Informix, MS SQL, DB2, MySQL, PostgreSQL), SAS, Gremlin, Hadoop, Python, R, Octave, Cloud Computing (AWS, GCP, Azure) and Virtualisation (Vagrant, Docker)
David's Super Power: Focus on target
20+
70+
6
30+
Experience
EMEA Ecosystem Architect @ Teradata (UK)
Feb 2020 - Present
Jan 2023 - Present: International Analytics and Architecture
- Architecture to Go-To-Market teams with respect to Financial Services and EMTG industries;
- Accountability for Analytics and Exploitation area;
- Lead for messaging on Data Centric AI, including Enterprise Feature Store;
- Solution Engineering support for large global Transportation and Logistics company;
- Focus on Enterprise Feature Store, SAS Accelerator, Vantage modernization, API integration, visualization of data in Vantage using 3rd-party tools;
- ArchiMate standardization.
Feb 2022 - Jan 2023: True Solution Engineer
- Solution Engineer for global Transportation and Logistics company;
- Migration of Near-real-time processing use case from on-prem to Vantage on Azure;
- Running internal labs for High Availability integration in Hybrid Cloud Ecosystem;
- Executive Briefings and workshops with key stakeholders;
- Positioning the Vantage CloudLake;
- TDVM (Teradata on VMWare) expansion;
- Streaming Processing with Teradata presentation;
- Implementation patterns for Transportation and Logistics use cases;
- Design of two new on-prem IntelliFlex systems and integration with existing Backup-and-Restore solution.
Oct 2021 - Feb 2022: Back to Ecosystem Architecture
- Participation on design of concepts such as Modern Analytics Architecture, Analytics 1-2-3, Feature Store and Teradata Data Mesh;
- Industry Implementation Patterns SME for Finance;
- Implementation patterns for integration with RegTech calculation engines;
- Mapping of the Financial Services Data Model to RegTech calculation engines;
- Cloud Journey (benefits, risks, migration patterns) presentation for large European banks;
- RFP for RegTech integration (large European bank);
- Ecosystem Architecture Repository management.
Sep 2020 - Oct 2021: Temporary Solution Engineer
- Solution Engineering for two big retail companies in Germany (POC and migration of NON-PROD environments to Google Cloud);
- Participation on design of concepts such as Modern Analytics Architecture and Analytics 1-2-3;
- Cloud Readiness Assessment for big German retailer and large UK bank;
- Cloud Migration SME;
- Industry Implementation Patterns SME for Finance;
- Project Management: POC + NON-PROD onboarding and migration for big German retailer
- RFP for Scandinavian tax agency (joined solution with SAS) and big UK bank (Financial Risk + Treasury);
- RFP for big UK bank (Regulatory Reporting);
- Architecture design / roadmap for centralized regulatory reporting (a response to BoE paper);
- Presentations of the Industry Logical Model for Finance;
- Performance awards: H1/2021.
Feb 2020 - Sep 2020: Ecosystem Architecture
- Participation on design of concepts such as Teradata Data Management Capabilities and Data Silos Consolidation;
- Ecosystem Architecture supervision for big UK bank (migration to Google Cloud);
- Ecosystem Architecture Roadmap supervision for 10+ EMEA Commercial and Enterprise accounts across Finance, Retail, CPG, Aviation verticals;
- Environmental impact of cloud migration study;
- Performance awards: H1/2020.
External Data Quality Management / MDM Consultant @ Malfini, a.s.
Mar 2021 - Present
Lector @ Business Insitut
Sep 2011 - Present
- MBA IT Management Programme;
- Data, Information and Knowledge Management modul;
- Data Management modul;
- Knowledge Management modul;
- Web Services modul;
Teacher @ University of Economics in Prague
Sep 2010 - Present
- M.Sc. Data Quality Managment course (2012-Present);
- Bc. Data Quality Managment course (2022);
- Processing of Information and Knowledge course (2010-2012);
- Supervision of Bachelor and Master theses;
AVP Project Management @ LISS Software (EXL)
Oct 2019 - Jan 2020
- Data Migration Lead for migration of BFS legacy system to LISSIA policy management system;
- Design of LISSIA Data Migration and Quality Assurance approach;
- Design and implementation of Python-based data validation framework;
- Design and implementation of T-SQL-based source-to-target reconciliation framework;
- POC for automated testing using Selenium.
AVP Analytics @ EXL Service (UK)
Oct 2018 - Sep 2019
- Creating proposals for future projects focused on GDPR compliance, Operating Model redesign, reporting optimisation and Risk Management Expert System design (5+);
- Responsibility for development of Unity Data Asset Management / GDPR Solution (November 2018 - Now);
- Design of detailed implementation project plan for Unity Solution;
- Unity demo sessions for clients / prospects (5+);
- Design and implementation of scorecard metrics, proving benefits from implementation of Data Lake and governed process of data delivery via CDO at Santander (February 2019 – May 2019).
Senior Consultant @ Dufrain UK
Jul 2017 - Oct 2018
- Dufrain AWS Cloud Administrator and Architect;
- Emerging Technologies Community Lead;
- London SAS Community Lead, SAS Admin;
- Managing personal development of 4 appraisees;
- SAS installation / configuration on MS Server / Red Hat;
- PoC for Sentiment / Context Analysis using KNIME, R and Python.
DWH Technical Lead @ Ultimate Finance (Apr 2018 - Oct 2018)
- Technical and Design Lead on Data Warehouse project;
- Hybrid ETL using a combination of MS SQL Server on Azure, SSIS, PowerShell, Python, T-SQL, S3 buckets and C#.
Data Analyst @ RBS (Dec 2017 - Feb 2018)
- Mortgages Pilot Project;
- PoC for a single version of truth about Mortgages data;
- Data Lineage for existing flows;
- Reconnecting existing dashboard to Teradata DWH and Hadoop sources;
- Introduction of automated metadata-driven process for data synchronisation between Teradata DWH and SAS Visual Analytics;
- Introduction of DQ Framework for automated DQ checks;
- Recommendations for future architecture changes.
Senior SAS Developer @ Sainsbury's Bank (Jul 2017 - Dec 2017)
- Remediation - R4 Loans;
- Leading a team of 10+ SAS / Teradata developers;
- Responsibility for delivery of 200 migrated reports and new builds;
- Development of complex dashboard monitoring the structure of portfolio and exceptions for Credit Risk.
Senior Consultant @ Sopra Steria
Jul 2015 - Jun 2017
- Data insight using SAS Visual Analytics and Enterprise Guide;
- Data integration using various sources (DB2, Teradata, Hadoop) and tools (SAS, Business Objects);
- Quality Assurance and Management (peer reviews, validation controls design and implementation, reconciliation);
- Data Warehouse design and implementation;
- Deep knowledge of Big Data Ecosystem (Hortonworks University) with practical implementations using Hive, Pig, Oozie, Sqoop (Ulster Bank / Royal Bank of Scotland).
SAS Analyst @ RBS / Ulster Bank (Jun 2016 - Jun 2017)
- Project Sycamore - Mortgages Remediation;
- Simulation of documents triggered by different steps within clients' journeys;
- Data Mart design and development;
- Quality Assurance - code peer review, Data Validation;
- Data Reconciliation (impact analysis of changes in remediation populations);
- Data Lineage for remediation Data Marts;
- Hub Hero Recognition for popularization of RBS Hadoop Hub (February 2017);
- Data Integration using various data sources including Teradata, SAS and Hadoop.
MS SQL Server Reporting Services Developer @ Borough of Broxbourne (May 2016)
- Analysis of current reporting based on MS SQL Server Reporting Services
- Data Lineage of current flows
- Correction of identified issues
SAS Analyst @ RBS CPB (Jan 2016 - Apr 2016)
- Implementation of data mart providing single version of truth about clients’ digital activities;
- Integration of data from various sources including Teradata, DB2, Business Objects and flat files or spreadsheets;
- Analytical insights based on implemented Data Mart;
- Mentoring team members.
SAS Analyst @ The Phoenix Group (Jun 2015 - Dec 2015)
- SAS development of additional functionality into integration middleware communicating with could-based solution running portfolio stress scenarios;
- Development of additional reporting functionality;
- Development of tools for automated testing;
- Definition of Low Level Design specifications for future changes.
Data Scientist @ eBay UK
May 2014 - Jun 2015
- Deep dive into selected areas in UK, France, Italy and Spain;
- Performance insight for weekly SCRUM meetings;
- Pricing SME for France, Italy and Spain (from 01/2015);
- Linking product categories from EU sites to identify inventory gaps (05 – 07/2014);
- Building analytical self-service tools;
- Mentoring team members;
- Big Data analytics using Hive, Pig and MapReduce;
- Data preparation and statistical modelling using SAS, R and Python;
- Advanced Teradata queries performance tuning;
- Data visualization using Tableau;
- Descriptive and predictive modelling using statistical methods and Machine Learning;
- Text Mining (keywords extraction) using Python NLTK and KNIME;
ROE FP&A Analyst @ eBay UK
Aug 2013 - May 2014
- Analytical support for projects in France, Italy and Spain;
- Performance insight for weekly SCRUM meetings;
- Evaluation of projects focused on sellers’ performance improvement;
- Complex evaluation of Retail Promos performance and co-investment effectiveness;
- Descriptive models explaining reasons of churn after first purchase;
- Optimization and automation of Retail Promos reporting (SAS, Teradata);
- Mentoring junior team members;
- Evaluation of changes using AB Testing;
- Data processing and statistical modelling using SAS and R;
- Advanced Teradata SQL.
Senior Data Analyst @ eBay Czech Republic
Nov 2012 - Jul 2013
- Analytical support for France, Italy and Spain;
- Analytical SME for Trust and Buyer Experience area;
- Data preparation and statistical modelling using SAS (Enterprise Guide, Enterprise Miner).
Project Manager @ Raiffeisenbank
Jan 2011 - Oct 2012
- Project Manager on Siebel CRM implementation (part of huge bank transformation programme);
- CRM Track Test Leader – coordination of functional and system integration tests (200+ interfaces);
- Retail Sales Management Stream Leader – requirements analysis and design of Product 365 / Lead Management process;
- Product Catalogue Stream Leader – requirements analysis and design of centralized product catalogue and its integration to satellite applications;
- Coordinator of User Acceptance Tests for Retail Banking – design and interactive testing with users.
CRM Team Leader @ Generali
Jan 2010 - Dec 2010
- Design and realisation of retention programme;
- Responsibility for customers de-duplication and households identification;
- Implementation of Customer Lifetime Value (CLV) concept and value-based segmentation;
- Implementation of behavioural segmentation (in cooperation with external agency);
- Implementation of service and communication processes segmentation (based on CLV and behavioural segments);
- Design and realization of up-sell, x-sell, retention, acquisition and event-based campaigns, delivering required volume of revenue;
- Vendor selection and implementation of CRM / Campaign Management system;
- Reporting to Board member.
Senior Analyst for Data Mining @ Kooperativa, VIG
Mar 2009 - Dec2012
- Development of pricing models using statistical methods and Machine Learning – design and implementation of individual pricing in MTPL;
- Responsibility for validation, unification and initial de-duplication of customers as a part of MDM implementation;
- Guarantor of Analytical Data Mart (areas of responsibility: Actuarial Mathematics, Analytical CRM and Advanced Analytics);
- Reporting to Board member.
Actuary @ Kooperativa, VIG
Nov 2007 - Mar 2009
- Development of expert system for pricing in MTPL retail and fleet insurance;
- Maintenance and further development of claims prediction models;
- Participation on strategic decisions in MTPL, supporting B-1 level, communication with Board members.
Programmer / Analyst @ Kooperativa, VIG
Jul 2004 - Nov 2007
- Technical Solution Architect (“The SAS Guy”) of SAS BI platform – implementation of DWH, reporting and analytics across two insurance companies within VIG holding on AIX (2006 – 2007);
- Design and implementation of Data Warehouse on MS SQL platform (2004 – 2006);
- Implementation of IBM Enterprise Replication as a part of DWH solution;
- Consolidation of reference data into central Metadata Repository;
- Design and implementation of integration middleware between Core and Document Management System (2004 – 2005);
- Analytical support for multiple projects.
Freelancer
Feb 2001 - Jul 2004
- Web design (~20 websites and applications);
- SEO (Search Engines Optimization);
- Web advertising;
- Propagation CDs, animations and graphics;
- Web applications design and development (portals, eShops).
Projects
Data Quality CZ
2011 - 2015
Legacy online portal (written in Czech) publishing articles and studies focused on Data / Information Quality Management, Master Data Management and Data Governance.
4IZ562 - Data Quality Management
2011 - Present
Supporting materials for my Ms.C. classe taught by David Pejcoch at the University of Economics in Prague.
https://github.com/dpejcoch/4IZ562
Pontus Vision GDPR
2018 - 2020
Participation on development of open source GDPR / Data Asset Management Tool governed by Pontus Vision company.
https://github.com/dpejcoch/nifi-flows
DQM Journal COM
2019 - Present
New online portal publishing articles and studies focused on Data / Information Quality Management, Master Data Management and Data Governance.
Data Governance Toolkit
2010 - Present
Lab environment for Data Quality Management tutorials and courses taught by David Pejcoch. Simulation of real scenario from Insurance vertical.
Data-Information-Knowledge MBA
2011 - Present
Supporting materials for my MBA classes taught at Business Institut in Prague.
https://github.com/dpejcoch/MBA-IT
Qualification
Badges Board
Formal Education
- Ph.D. @ University of Economics, Prague: Ph.D. studies of Informatics (Computing), 2007 - 2015
- Ing. @ University of Economics, Prague: Master degree in Statistics and Insurance Engineering / Management in Information Society, 2006
- DiS. @ Higher School of Tourism, Prague: Graduated Specialist, 2000
- CAPM @ PMI: CAPM - Certified Associate in Project Management, 07/2012, (renewed 07/2017), License #1527183
Certificates and Trainings
2023
- Data GovernanceUdemy: Data Governance Fundamentals, 01/03/2023, C-a32b7ed2-b0a9-4d98-a71e-3d9b0e6745d7
- AzureLinkedIn Learning: Exam Prep: Microsoft Azure Fundamentals (AZ-900), 03/02/2023, AVu4U2ZUCL8ou7uvaVuaXxjIVJkZ
- AzureLinkedIn Learning: Prepare for the Microsoft Azure Fundamentals (AZ-900) Certification Exam Learning Path, 03/02/2023, AQmNwYIjeljDq_2ppWlYTVEvM4Hu
- AzureLinkedIn Learning: Learning Azure Network Security 30/01/2023, Abr-Dgt7Zfb-m23JYIVVtwF7ByhO
- AzureLinkedIn Learning: Microsoft Azure Fundamentals (AZ-900) Cert Prep: 2 Azure Architecture and Security, 06/01/2023, AW6cCU0gW1MB3GqV-Rg-DUmPJLmD
- AzureLinkedIn Learning: Learning Azure Management Tools, 03/01/2023, AfOF6nHBrtK_XNo_thBkQAyq0GUh
2022
- AzureLinkedIn Learning: Azure Active Directory Basics, 02/12/2022, AY7uOBRSbVs2lypZwyixGO2YPIP5
- AzureLinkedIn Learning: Microsoft Azure Fundamentals (AZ-900) Cert Prep - 1 Cloud Concepts, 01/12/2022, AYiIPGbHnNpnIGYTA5nFIfCD6788
- VMwareLinkedIn Learning: VMware vSphere Advanced Networking, 01/12/2022, AYQdjzfNE3EZekdQrDvEqh_KJVPT
- Data QualityUdemy: Data Quality Masterclass - The Complete Course, 30/12/2022, AYQdjzfNE3EZekdQrDvEqh_KJVPT
2021
- Enterprise ArchitectureUdemy: New for 9.2! Part 2 Certified Certification Training (TOGAF), 10/29/2021, UC-495c1258-916f-4fae-9fa2-24fcbc3968a8
- Enterprise ArchitectureCoursera: Enterprise Architecture, 01/16/2021, RQHH46J58HZE
- AutomationUdemy: Raise Of The Machines - Impact Of Automation On A Human World, 09/08/2021, UC-78e7e8db-a987-4d5c-915c-f7a958c291e1
- TeradataVantage on Azure - Fundamentals, 09/04/2021
2020
- LearningCoursera: Learning How to Learn: Powerful mental tools to help master tough subjects, 08/12/2020, U646M4F85RFY
- CloudCoursera: AWS Fundamentals: Migrating to the Cloud, 24/11/2020, L4C6CNVTPUQ5
- LinuxUdemy: Linux Security and Hardening, The Practical Security Guide, 9/11/2020, UC-b2373934-b3e8-40be-affb-d1162ad0cc89
- TeradataTeradata University: Teradata Factory Xpress - Vantage Advanced SQL Engine, 21-30/10/2020
- CloudUdemy: Apache Beam | Future of Big Data, 02/10/2020, UC-edbd2931-3faf-4a59-9e43-54245f937eb8
- CloudUdemy: Apache NiFi - A Complete Guide | Cloudera DataFlow | HDF/CDF, 12/09/2020, UC-d3e68020-7ff8-4941-9aea-56445ecdd057
- CloudUdemy: GCP: Complete Google Data Engineer and Cloud Architect Guide, 06/08/2020, UC-291fd18d-86a7-4293-9b3b-703048fc6ad0
- GardeningUdemy: Growing Microgreens for Business and Pleasure, 04/06/2020, UC-15234de4-5ae2-486d-8893-dc2bf89a4c5a
- RoboticsUdemy: Robotic Process Automation - RPA Overview, 04/06/2020, UC-8c16105a-e6f2-404c-b7bc-5af0bc3e99bb
- TOGAFUdemy: New for 9.2! Part 1 Foundation Certification Training, 25/02/2020, UC-a07e39f4-6ff6-4e97-84b2-97535cfa8265
- TeradataSales Tech Cloud - EMEA Basics and Advanced, 17-21/02/2020
- GraphUdemy: OrientDB - Getting Started, 26/01/2020, UC-8NBR34IZ
- NeuroscienceCoursera: Biohacking Your Brain's Health 23/01/2020, NGEF7A6FWNNJ
- Time ManagementCoursera: Work Smarter, Not Harder: Time Management for Personal & Professional Productivity 02/01/2020, 9HZS7DAQQYPM
- GraphCoursera: Introduction to Graph Theory, 01/01/2020, 9BJMRA9B3MU5
2019
- AWSCoursera: Getting Started with AWS Machine Learning, 25/11/2019, NAWU2SXXWXW6
- Deep LearningCoursera: AI For Everyone, 21/04/2019, VBMMASBUHPV5
- IBMCoursera: Data Science Methodology, 01/02/2019, DGZ7CVVNVQSZ
2018
- IBMCoursera: What is Data Science?, 31/12/2018, ZPFFWQ33UEMB
- PythonDataquest: Python Programming - Intermediate, 12/12/2018, FZ083CPD5G6AM4ZZB3A6
- GCPUdemy: Learn GCP - Google Cloud Data Engineer Express Course!, 03/27/2018, UC-676EFDRH
- GCPCoursera: Google Cloud Platform Fundamentals: Core Infrastructure, 15/12/2018, THNSVK49K9HE
- GCPCoursera: Open Source tools for Data Science, 30/12/2018, GUVZJQ39UBMT
- PostgreSQLUdemy: PostgreSQL - From Zero to Hero, 03/21/2018, UC-JOD137E0
- RCoursera: The R Programming Environment (Johns Hopkins University), 19/03/2018, RX8ULLSVVJSN
- HadoopUdemy: Learn Big Data - The Hadoop Ecosystem Masterclass, 18/03/2018, UC-WF7M4727
- Deep LearningCoursera: Deep Learning Specialization (deeplearning.ai), 20/02/2018, LAVDJPFFHJDE
- Deep LearningCoursera: Sequence Models (deeplearning.ai), 20/02/2018, GETPYBM9PE7S
2017
- GDPRUdemy: Introduction to EU GDPR, 12/26/2017, UC-EZEEUI1T
- Big DataCoursera: Big Data Integration and Processing (UC San Diego), 12/26/2017, EMZD7V9GVHZ5
- Deep LearningCNNCoursera: Convolutional Neural Networks (deeplearning.ai), 12/03/2017, SKUB7EWDKL6M
- Big DataCoursera: Introduction to Big Data (UC SAn Diego), 11/04/2017, WU8ALPRUAJZD
- Big DataMLCoursera: Machine Learning With Big Data (UC SAn Diego), 11/02/2017, DMWN5K5JCLBU
- IoTCCoursera: The Arduino Platform and C Programming (CUI University of California, Irvine), 10/23/2017, ZN4G3TQG652G
- Deep LearningCoursera: Structuring Machine Learning Projects (deeplearning.ai), 10/20/2017, RQVZWNKY6BUG
- Six SigmaCoursera: Data Analytics for Lean Six Sigma (University of Amsterdam), 10/03/2017, 4PYVJMTZ4PNV
- Deep LearningCoursera: Improving Deep Neural Networks - Hyperparameter tuning, Regularization and Optimization (deeplearning.ai), 10/03/2017, W5WW43J67W4Z
- UnixCoursera: The Unix Workbench (Johns Hopkins University), 09/11/2017, 5RCFBACK5PZ9
- Deep LearningCoursera: Neural Networks and Deep Learning (deeplearning.ai), 09/06/2017, VE7MHYNWL6EK
- VisualizationD3.jsCoursera: Data Visualization (University of Illinois at Urbana-Champaign), 08/31/2017, FPQ2RT56D6VB
- Deep LearningTensor FlowCoursera: Serverless Machine Learning with Tensorflow on Google Cloud Platform (Google Cloud), 07/25/2017, W8BXG8D659ZU
- Project ManagementPMI: CAPM - Certified Associate in Project Management (renewal), 07/2017, License #1527183
- MarketingGoogle: The Online Marketing Fundamentals (Digital Garage), 03/13/2017
- MLCoursera: Neural Networks (University of Toronto), 01/15/2017, 72BQ4LV3K77K
2016
- RCoursera: Getting and Cleaning Data (Johns Hopkins University), 10/24/2016, DTKB69B5Q2QK
- MLCoursera: Machine Learning Foundations - A Case Study Approach (University of Washington), 05/01/2016, L2VXCN4XXB3S
- InternetCoursera: Internet History, Technology and Security (University of Michigan), 05/01/2016, FWW9Y3PG55B6
- PythonCoursera: Python Data Structures (University of Michigan), 02/28/2016, F32QJ8K6S2G4
- PythonCoursera: Using Python to Access Web Data (University of Michigan), 02/28/2016, MGUXGQGVV8QA
- PythonCoursera: Using Databases with Python (University of Michigan), 02/14/2016, GB7TLXNSGH4A
2015
- Process MiningCoursera: Process Mining (Eindhoven University), 11/2015
- Big DataCoursera: Introduction to Big Data (University of California, San Diego), 10/2015
- MLCoursera: Machine Learning (Stanford University), 10/2015
2014
- RCoursera: R Programming (Johns Hopkins Bloomberg School of Public Health), 12/2014
- Data ScienceCoursera: The Data Scientist's Toolbox (Johns Hopkins Bloomberg School of Public Health), 11/2014
- Big DataBig Data University: Big Data Fundamentals, 08/25/2014
2012
- Project ManagementPMI: CAPM - Certified Associate in Project Management, 07/2012, License #1527183
2009
- ManagementKooperativa: 2nd Level Managers' Education Programme, 09/2009
- SASSAS: PMEX - Extending SAS Enterprise Miner with User-written Nodes, 09/2009
- SASSAS: BDMCI - Advanced Analysis in CRM Using SAS Tools, 05/2009
- SASSAS: PMAD53 - Adanced Predictive Modeling in SAS Enterprise Miner 5.x, 02/2009
2008
- SASSAS: AMACR - Advanced Macro Programming in SAS, 11/2008
- SASSAS: AMUL - Multivariate Statistical Methods: Practical Research Applications, 11/2008
- SASSAS: SURV - Survival Analysis Using Model of Proporcional Hazards, 09/2008
- SASSAS: DMDP - Data Preparation for Data Mining, 09/2008
- SASSAS: AAEM53 - Data Mining with SAS Entrprise Miner 5.3, 08/2008
- SASSAS: PROG3 - Programming in SAS: Advanced Techniques, 05/2008
2007
- Enterprise ReplicationIBM: Configuration and usage of IDS Enterprise Replication, 06/2007
- SASSAS: OLAP Analysis in SAS (on demand training), 05/2007
- SASSAS: Introduction to SAS Business Intelligence (on demand training), 04/2007
- SASSAS: SAS Base, SAS DI Studio (on demand training), 01/2007
2006
- DWHAdastra: Data Warehousing - Advanced, 10/2006
- DWHAdastra: Data Warehousing - Basics, 10/2006
2004
- Informix4GLIBM: Advanced IBM Informix 4GL Development, 11/2004
Publications
Conference Proceedings
- PEJČOCH, D. TOWARDS AUTOMATIC EXTRACTION OF VALIDATION RULES FROM DATA.In Sborník příspěvků 14. mezinárodní konference IMEA 2014. 1. vyd. Liberec: Technická univerzita v Liberci – 2014, 2014. s. 225 – 228. ISBN 978-80-7494-106-1
- PEJČOCH, D. Benchmark přístupů k Fuzzy Match / Merge. In: Sborník prací účastníků vědeckého semináře doktorského studia. Fakulta informatiky a statistiky VŠE. Praha 2009. ISBN 978-80-245-1524-3.
- PEJČOCH, D. Vztah řízení dat k ostatním oblastem řízení informatiky. In: Sborník prací účastníků vědeckého semináře doktorského studia. Fakulta informatiky a statistiky VŠE. Praha 2011. ISBN 978-80-245-1761-2.
- PEJČOCH, D. Datová kvalita jako klíčový faktor úspěšnosti implementace CRM systému v pojišťovnictví. In: Sborník mezinárodní konference Evropské finanční systémy 2011. Masarykova univerzita, Ekonomicko-správní fakulta. Brno 2011. ISBN 978-80-210-5509-4.
- PEJČOCH, D. Dopad nekvalitních dat do metrik výkonnosti podniku. In: IV. Mezinárodní vědecká konference doktorandů a mladých vědeckých pracovníků. OPF SU Karviná, 2011. ISBN 978-80-7248-711-0.
- PEJČOCH, D. Aplikace pro podporu auditu datové kvality. In: Sborník prací účastníků vědeckého semináře doktorského studia. Fakulta informatiky a statistiky VŠE. Praha 2012. ISBN 978-80-245-1862-6.
- PEJČOCH, D. Audit datové kvality podle IT Assurance Guide: Using COBIT. In: IMEA 2012 Sborník příspěvků 12.ročníku doktorandské konference, Hradec Králové, CZ, UHK, 2012, s. 6, ISBN 978-80-7435-185-3
- PEJČOCH, D. Using The Data Quality Knowledge Base in IT Performance Management. In: Proceedings of the Electronic International Interdisciplinary Conference 2012, ISSN:1338-7871, ISBN 978-80-554-0551-3.
- PEJČOCH, D. Znalostní báze jako základ řešení pro řízení datové kvality. In: Znalosti 2012. Praha: Matfyzpress, 2012. ISBN 978-80-7378-220-7.
Articles in Reviewed Journals
- PEJCOCH, D. Imputation Paradox. In: Forum Statisticum Slovacum. Vol. 6/2014. ISSN 1336-7420.
- PEJCOCH, D. Critical Evaluation of Validation Rules Automated Extraction from Data. In: Journal of System Integration. Vol 5, No 4 (2014). ISSN: 1804-2724.
- PEJČOCH, D. CADAQUES: Metodika pro komplexní řízení kvality dat a informací. In: Acta Informatica Pragensia. 2014. sv. 3, č. 1, s. 44--56. ISSN 1805-4951. Indexace: Directory of Open Access Journals (DOAJ), Google Scholar, Academic Journals Database a ResearchBib Journal Database
- PEJČOCH, D. Metody řešení problematiky neúplných dat. In: Forum Statisticum Slovacum, Bratislava, ročník 7, číslo 7, 2011, s. 187 – 192, ISSN 1336-7420.
- PEJČOCH, D. Vztah řízení dat k ostatním oblastem řízení informatiky. In: Systémová integrace, 2011, roč. 18, č. 4, s. 3 - 13. ISSN: 1210-9479.
Articles in Not-reviewed Journals
- PEJČOCH D. Použití nástroje DataFlux pro řízení datové kvality. In: Computerworld [online], 27.10.2008. Dostupné pod odkazem: [URL]
Guidelines and White Papers
- JERRIM D., PEJCOCH D., SCHULZ J. Teradata & Google Cloud - Best of Breed Ecosystem Architecture, 1st Edition. Teradata White Paper, 2021.
- JERRIM D., PEJCOCH D. Teradata & Google Cloud - Best of Breed Ecosystem Architecture, 2nd Edition. Teradata White Paper, 2021.
- ANDERSON R., PEJCOCH D., FLEURY W. EFS (Enterprise Feature Store) Orange Guide, 1st Edition. Teradata Orange Guide, 2021.
Theses
- PEJČOCH, D. Kritická analýza použitelnosti metod data mining v pojišťovnictví, DP. VŠE 2006.
- PEJČOCH, D. Komplexní řízení kvality dat a informací, DDP. VŠE 2015.
Articles Published on Data Quality CZ Portal / Journal for Data Quality Managment
- PEJČOCH, D. Master Data Management. In: Data Quality CZ [online]. 2015-02-01 00:00:00. Dostupné z: [URL]
- PEJČOCH, D. Oprava knihovny SIMMETRICS pro porovnávání a slučování záznamů. In: Data Quality CZ [online]. 2014-07-01 00:00:00. Dostupné z: [URL]
- PEJČOCH, D. Big Data Quality: Practical Approach – Part 1. In: Data Quality CZZ [online]. 2014-04-01 00:00:00. Dostupné z: [URL]
- PEJČOCH, D. Tutorial: Instalace nástroje Talend Open Studio for MDM na virtuální stroj. In: Data Quality CZ [online]. 2014-03-01 00:00:00. Dostupné z: [URL]
- PEJČOCH, D. Data Governance: kam se poděla strategická úroveň řízení dat?. In: Data Quality CZ [online]. 2014-03-01 00:00:00. Dostupné z: [URL]
- PEJČOCH, D. Kauzality mezi vlastnostmi napříč hierarchií znalostí - 1.díl. In: Data Quality CZ [online]. 2014-01-01 00:00:00. Dostupné z: [URL]
- PEJČOCH, D. Porovnávání řetězců s využitím nástroje BASE SAS– 2. díl In: Data Quality CZ [online]. 2013-12-01 14:36:44. Dostupné z: [URL].
- PEJČOCH, D. Information Quality Assurance s využitím nástroje SAS – 1.díl In: Data Quality CZ [online]. 2013-12-01 14:36:44. Dostupné z: [URL].
- PEJČOCH, D. Dopady nekvalitních dat In: Data Quality CZ [online]. 2013-11-01 17:14:27. Dostupné z: [URL].
- PEJČOCH, D. Relevantní normy pro oblast kvality dat a informací In: Data Quality CZ [online]. 2013-11-01 17:14:27. Dostupné z: [URL].
- PEJČOCH, D. Úloha vizualizace při řízení datové kvality In: Data Quality CZ [online]. 2013-10-01 00:00:27. Dostupné z: [URL].
- PEJČOCH, D. Big Data Quality - 1. díl In: Data Quality CZ [online]. 2013-06-29 17:14:27. Dostupné z: [URL].
- PEJČOCH, D. Role validace při řízení datové kvality In: Data Quality CZ[online]. 2013-05-20 17:14:27. Dostupné z: [URL].
- PEJČOCH, D. Linked Data Quality – 2.díl. In: Data Quality CZ [online]. 2013-05-18 19:21:56. Dostupné z: [URL].
- PEJČOCH, D. Linked Data Quality – 1.díl. In: Data Quality CZ [online]. 2013-04-13 12:23:55. Dostupné z: [URL].
- PEJČOCH, D. Řízení kvality dat s využitím nástrojů firmy Talend – 1.díl. In: Data Quality CZ [online]. 2012-05-06 12:23:55. Dostupné z: [URL].
- PEJČOCH, D. Audit datové kvality podle IT Assurance Guide: Using COBIT - 1. díl. In: Data Quality CZ [online]. 2012-03-07 07:14:52. Dostupné z: [URL].
- PEJČOCH, D. Audit datové kvality podle IT Assurance Guide: Using COBIT - 2. díl. In: Data Quality CZ [online]. 2012-03-07 07:14:52. Dostupné z: [URL].
- PEJČOCH, D. Audit datové kvality podle IT Assurance Guide: Using COBIT - 3. díl. In: Data Quality CZ [online]. 2012-03-07 07:14:52. Dostupné z: [URL].
- PEJČOCH, D. Použití nástrojů DataFlux pro řízení datové kvality. In: Data Quality CZ [online]. 2012-02-05 17:47:43. Dostupné z: [URL].
- PEJČOCH, D. Řízení kvality dat s využitím nástrojů firmy Talend – 1.díl. In: Data Quality CZ [online]. 2012-05-06 12:23:55. Dostupné z: [URL].
- PEJČOCH, D. Porovnávání řetězců s využitím nástroje BASE SAS - 1. díl. In: Data Quality CZ [online]. 2012-01-03 16:23:28. Dostupné z: [URL].
- PEJČOCH, D. Deduplikace souborů ve Vašem PC – 1. díl. In: Data Quality CZ [online]. 2012-01-01 20:47:30. Dostupné z: [URL].
- PEJČOCH, D. Jaké mají být vlastnosti pracovníka v oboru řízení datové kvality?. In: Data Quality CZ [online]. 2012-01-01 20:39:06. Dostupné z: [URL].
- PEJČOCH, D. Unix shell: příručka pro DQM - 1. díl. In: Data Quality CZ [online]. 2012-01-01 20:28:37. Dostupné z: [URL].
- PEJČOCH, D. Dopad nekvalitních dat na úlohy získávání znalostí z databází. In: Data Quality CZ [online]. 2011-12-12 23:23:42. Dostupné z: [URL].
- PEJČOCH, D. The Hive Project In: Data Quality CZ [online]. 2012-06-01 17:14:27. Dostupné z: [URL].
- PEJČOCH, D. Datová kvalita jako klíčový faktor úspěšnosti implementace CRM systému v pojišťovnictví In: Data Quality CZ [online]. 2011-11-01 17:14:27. Dostupné z: [URL].
- PEJČOCH, D. Pyramida znalostí jako základ konceptuálního modelu datové kvality In: Data Quality CZ [online]. 2011-01-06 00:14:27. Dostupné z: [URL].
- PEJČOCH, D., SOBÍŠEK, L. Identifikace vztahů mezi partnery společnosti - 1. díl In: Data Quality CZZ [online]. 2011-06-19 00:14:27. Dostupné z: [URL].
Events
- Teradata EMEA Data Architect User Group (Online, 27/02/2023): Real-time Data Ingestion into Teradata (an overview and lessons learned from previous NRT implementations).
- Teradata Virtual Tech Summit 2022 (Online, 14/11-18/11): Mission Critical Ingest (Near real-time analytics using Vantage) (with Babak Timoury).
- Teradata Global Sales Experience 2021 (Online, 26/01/2021): Enterprise Feature Store (with Jean-Charles Ravon).
- AWS reInvent (Online, 12/2020): Machine Learning of the Future (with Chris Hillman).
- AWS reInvent (Online, 12/2020): Teradata Cloud Integration (with Wayne Jones).
- Teradata Feature Store Symposium (Online, 09/2020): Architecture of Feature Store
- Data a Znalosti (Praha, CR, 10/2015): Big Data Quality / Governance.
- Akademie ICT Managementu (Certified Manager of ICT – CMICT) Přednášky: Kvalita dat a její měření Procesy kontroly, čištění a optimalizace dat; Master Data Management: Procesní pohled; Master Data Management: Technologický pohled; Řešení zlepšování kvality dat.
- Školení PSSZ organizované jako součást operačního programu Lidské zdroje a zaměstnanost CZ.1.04/4.1.00/41.00002 Řízení a správa dat a datové kvality. Přednášky: Kvalita dat, Master Data Management (I), Master Data Management (II), Procesy kontroly, čištění a optimalizace dat, Řešení zlepšování řízení kvality dat.
- SAS Analytika Roadshow 2013 (Praha, ČR, 16.10.2013): Data in Online Business. Dostupné z: [PDF] | [YouTube]
- eBay Data Conference 2013 (San José, CA): Big Data Quality: Just a New Buzzword or Serious Topic? Dostupné z: [PDF] (slidy s interními informacemi eBay byly odstraněny).
- Symposium EDI 2009 (Praha, ČR, 17.4.2009): Inteligentní Business Intelligence (s Vladimírem Kyjonkou) Dostupné z: [PDF]
- SAS Forum 2008 (Bratislava, SR): Datová kvalita v praxi. Dostupné z: [PDF]
Tutorials
- PEJČOCH, D. Sylabus modulu: Data Management. [s.l.]: Business Institut,©2011.
- PEJČOCH, D. Sylabus modulu: Knowledge Management. [s.l.]: Business Institut,©2011.
- PEJČOCH, D. Sylabus modulu: Využití webových technologií v podnikové praxi. [s.l.]: Business Institut,©2012.
- PEJČOCH, D. Sylabus modulu: Řízení dat, informací a znalostí. [s.l.]: Business Institut,©2013.
- PEJČOCH, D. Talend Open Studio MDM. In: Data Quality CZ [online]. 2013-04-06 13:00:00. Dostupné z: [URL].
- PEJČOCH, D. Talend Open Studio DQ. In: Data Quality CZ [online]. 2013-05-26 15:00:00. Dostupné z: [URL]
- PEJČOCH, D. 4IZ562 Cviceni: Jak na Talend MDM - profilace. In: youtube [online]. 2014-03-21 12:30:00. Dostupné z: [YouTube]
KEG (Knowledge Engineering Group) Presentations
- Date and time: 2011-06-02 (10:30 – 12:00). Room: 403 NB Comparison of methods for imputation of missing values: David Pejčoch (Raiffeisen Bank ČR). Dostupné z: [PDF]
- Date and time: 2013-05-09 (10:30 – 12:00). Room: 336 RB Linked Data Quality: David Pejčoch (eBay + KIZI VŠE). Dostupné z: [PDF]
- Date and time: 2008-04-17 (10:30 – 12:00). Room: 403 NB Using the Fuzzy Match algorithm for data cleaning: David Pejčoch (Kooperativa pojišťovna, a.s.). Dostupné z: [PDF]
Unpublished Articles and Studies
- Referát na IZI901 (Teorie informačních a znalostních systémů): Využití Fuzzy Match algoritmu pro čištění dat [PDF]
Contact
Keen to know more about my projects? Do you want to participate? Do you need a help with your own project? Drop me an email or short message using this contact form.
david@pejcoch.com