https://eugeneyan.com/rss 09月30日
数据科学与敏捷开发
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

近日受邀主持“数据科学与敏捷开发——可行吗?”专题讨论会,该讨论会作为GovTech开发者、程序员和技术人员的首届STACK会议的一部分举行。讨论会探讨了敏捷开发在数据科学领域的适用性,参与者包括来自Lazada、GovTech、Skyscanner和科研机构的专家。讨论内容涵盖敏捷思维在预算规划、数据分析、团队协作中的应用,以及如何通过迭代开发避免传统瀑布式方法的缺陷。专家们强调敏捷是一种文化而非严格流程,并建议技术新人通过作品集展示能力。

📊 敏捷开发的核心是迭代思维,通过小步快跑、持续反馈调整方向,适用于数据科学领域。例如,通过快速分析确定电商推荐引擎优先级,避免资源浪费。

💡 敏捷并非严格遵循某套流程,而是以人为重,灵活调整。讨论会引用敏捷宣言‘人比流程更重要’,强调团队自主性。

📈 数据科学团队可通过敏捷实践提升效率:将项目拆分为分析周期,用可衡量的成果(如假设验证、可视化)替代传统交付物,实现动态优化。

🌐 专家建议技术新人构建作品集证明能力,类比摄影师需展示作品集,强调实践案例比证书更关键。

🔄 敏捷的循环反馈机制能避免瀑布式开发中需求错配问题。以电商推荐引擎为例,敏捷可提前验证用户真实需求,防止开发成果与业务脱节。

Recently, I was invited to moderate a panel on the topic “Data Science and Agile–can or not?” It’s a Singlish way of asking if Agile can be applied in the domain of data science. The panel was held in conjunction with GovTech’s inaugural STACK conference for developers, programmers, and technologists from the private sector.

Check out the awesome venue we have

Who was in the panel?

The panel involved the following guests, from left to right in the photo below:

    Eugene Yan (that’s me as moderator): VP of Data Science at Lazada (acquired by Alibaba), currently Senior Data Scientist at uCare.ai. Steven Koh: Director of Government Digital Services at GovTech leading the Agile Consulting and Engineering team and evangelising agile development in the government. Adam Drake: Formerly Chief Data Officer at Skyscanner and Redmart, with an exemplary record in the design, development, and delivery of cost-effective, high performance tech teams and systems. Ivan Zimine: Physicist and neuroscientist who works on complex systems while applying open source and open practices.https://eugeneyan.com/assets/datascienceandagile2.jpgascienceandagile2.jpg" title="Data Science and Agile—can or not?" loading="lazy" alt="Data Science and Agile—can or not?">

    What were some of the notable questions and responses?

    I don’t do anything related to technology–why should I care about agile or scrum?

    The view from the panel was that Agile is a mindset and culture of having small iterations, continuous feedback, and course corrections and improvements. This is applicable not just in tech, but in other areas as well.

    One panelist gave a humorous (though not very accurate) example of adopting agile in budget planning, where the daily spending is adjusted based on each month’s cumulative spending, with estimates for total spending in the month. While these estimates are unlikely to be accurate down to the dollars and cents, they provide a ballpark figure for one to aim for.

    What does agile look like in the context of data science? How does the data science team fit into agile rituals? Do they follow daily stand-ups and planning?

    Some audience members had difficulty understanding how agile could be adopted in a data science team. Others are part of data science teams that tried practicing it, but with limited success.

    For the panelists (and myself), we felt that the mindset of agile could also be adopted in the context of data science, where projects are done in small iterative cycles. While the deliverables may not always be a working product, or additional features, there are measurable deliverables. This could come in the form of analyses that help understand possible causes of an issue, or the testing of multiple hypotheses to identify the key problem, or visualisations that provide better understanding of the context.

    Overall, the intent is iterative development, instead of adopting a traditional waterfall approach. In a waterfall approach, significant time would be spent developing a project plan and technical specs which are then “frozen” (i.e., minor changes are difficult, major changes are almost impossible).

    Next, the system is throughly designed based on the tech specs and developed over several months or years. Sometimes, the system delivered may be less or no longer relevant to the organization given that it was planned and designed years ago. Or worse, the project is found to be solving the wrong problem, or solving the right problem with the wrong approach, at the later stages of the waterfall cycle. Months/years of effort would have gone down the drain.

    For example, perhaps a decision was made to develop a recommendation engine for an e-commerce site. After significant planning, designing, and development, it was finally released after a year or two. However, there was no measurable lift to site metrics (e.g., conversion, revenue, daily average users).

    Eventually, a study on user purchasing behaviour and journey found that most of the sales funnel was generated by the search engine, with it accounting for > 90% of clicks and purchases. Very few users actually browsed based on the recommendation engine on the homepage, search, and product pages.

    In the example above, by adopting an agile approach, perhaps some quick analysis would have been done to determine which recommendation engine to develop first–search, product, or homepage? In this process, it would have been discovered that the potential gain from a recommendation engine would be significantly less than improving the search engine. The team can then course correct and redirect their efforts, saving precious resources and time and delivering measurable value.

    What does it mean to practice Agile? Is there such as thing as true Agile? A lot of companies claim they are Agile but in reality have projects crunched by unrealistic deadlines, unclear requirements, etc.

    Currently, it seems there are different schools of thought around the concept of agile and a variety of ways that one can be certified for agile and scrum. Nonetheless, at its core, agile is a mindset and the fundamental principles are the same.

    The Agile Manifesto was raised, specifically, the first principle–“People over processes”. Imposing one school or practice of agile over another would be in violation of this principle. If a team finds practising the core principles of agile to be helpful in being more productive, that was good enough. There is no need to nitpick on whether they are adhering strictly to detailed agile methodology or techniques.

    In case you’ve not seen it before or need a refresher, herehttps://eugeneyan.comhttps://eugeneyan.com/assets/agile-manifesto.jpg

    The Agile Manifesto: People over Process

    I’m a recent technical graduate but have no experience with engineering in production, data science, etc. What can I do to get a technical role in either fields?

    The key advice panelists had was to develop a portfolio demonstrating one’s work. This could be in the form of small apps on a cloud server, or past analysis and write-ups, or simply blog posts.

    One apt example was shared by one of the panelists: He was out shopping for wedding photographers and assessing them based on their past portfolio. How many people would hiring a wedding photographer who did not have a portfolio? Similarly, as a technical candidate, having a portfolio helps to showcase your past work, giving potential hiring managers more confidence that you can deliver in the role.

    Overall, it was an enjoyable discussion with the expert practitioners in the panel. I hoped the audience learnt and benefited a lot from their sharing–I certainty did!

    As I moderate more and more panels, I find myself enjoying the discussion to a greater extent. In some of my past panels, I was nervous about keeping the conversation going and ensuring it was useful for the audience. Recently, the conversation is more casual, and I’m even able to joke around with the panelists. There were also natural follow-up questions that unearthed valuable experiences and anecdotes that the audience could take away. Looking forward to my next one!

    If you found this useful, please cite this write-up as:

    Yan, Ziyou. (Oct 2018). GovTech Conference - Data Science and Agile—Can or Not?. eugeneyan.com. https://eugeneyan.com/speaking/data-science-and-agile-can-or-not-talk/.

    or

    @article{yan2018govtech,  title   = {GovTech Conference - Data Science and Agile—Can or Not?},  author  = {Yan, Ziyou},  journal = {eugeneyan.com},  year    = {2018},  month   = {Oct},  url     = {https://eugeneyan.com/speaking/data-science-and-agile-can-or-not-talk/}}
    Share on:

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

数据科学 敏捷开发 敏捷思维 技术面试 GovTech
相关文章