2019-09-20 Steering Committee Minutes for Data Helix Project

Attendees

  • Alan Froggatt

  • Andrew Carr

  • Colin Eberhardt

  • Matt Richards

  • Antony Welsh

Minutes

 

We debated the question “Do we view the Generator as a financial services specific tool, or as a more generic utility?”

  • General consensus was that this tool is potentially useful across a very large number of domains, we should serve financial services, but also as many other domains as possible

 

We debated “The Generator has at times been considered a testing tool, a data simulation tool, a load testing tool - which is it? Should it be all of these things?”

  • We discussed creation of violating data, and it was considered to be a useful feature, but the question asked was, “what is the relative cost of supporting this feature?”

    • It was noted that violating data could be described by the profile without the need for a ‘violating’ mode

  • We determined that the cost of implementing violating / testing is high

  • General consensus was that this tool should generate realistic simulation data in the short-term

 

We debated how we prioritise feature development, should we tackle FS features first?

  • We have seen a number of non-FS teams wanting to use it already

  • Non-FS domains tend to be more simple, making them a good starting point

  • There was a general feeling that relational data is a priority

    • however, there are still a number of use-cases where it is useable without relational

    • Without relational, the tool is better suited for creating CSV files, API outputs, but not the population of database tables

  • Consensus was that relational data is a v2 priority

  • We briefly discussed how this tool should be ‘packaged’

    • We have found that people were wanting to use this tool in various different contexts, streaming direct to database, used within Python, etc …

    • Our current CLI approach doesn’t close any doors yet

 

We debated “Do we need to invest more in the Generator UX?”

  • We agreed that the user experience is key and a high priority

  • There was a brief chat about the value of allowing people to share profiles or components of profiles

  • We debated whether the generator (CLI and profile) is sufficient

    • There was general consensus that this is sufficient for v1.0

  • We discussed that the DataHelix team should spend some time on the profile design, using the steering group to discuss options.

Need help? Email help@finos.org we'll get back to you.

Content on this page is licensed under the CC BY 4.0 license.
Code on this page is licensed under the Apache 2.0 license.