https://eugeneyan.com/rss 09月30日
数据科学家生涯分享
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文采访了Tri5 Ventures的合伙人Christopher,探讨数据科学家的生涯发展。文章涵盖了数据科学家的背景、工作内容、职业发展以及行业内的常见误区。Christopher分享了他在心理学、统计学和机器学习领域的专业背景,以及如何将这些技能应用于实际工作中。他还揭示了数据科学工作中的一些常见误区,如机器学习并非主要工作内容,以及技术能力并非唯一关键。文章还讨论了数据科学在提升业务效率、改善用户体验等方面的应用,以及数据科学家的日常工作和职业满意度。最后,Christopher为有志于数据科学领域的人士提供了职业发展建议。

🔍 数据科学家的工作内容主要包括数据理解、探索、清洗、准备和特征工程,这些工作占据了约80%的时间。机器学习仅占约20%,其余时间用于构建概念验证、验证/AB测试、开发API和部署等。

🗣️ 数据科学家不仅需要具备技术能力,还需要能够深入理解问题、设计端到端解决方案,并有效地进行沟通和说服。技术能力虽然重要,但并非唯一关键。

📊 数据科学在提升业务效率、改善用户体验等方面发挥着重要作用。例如,通过客户/卖家/产品的细分和画像,自动化和扩展流程以降低成本,以及通过推荐和排名等功能改善业务成果。

💡 数据科学家的日常工作包括编码、分析和解决问题。Christopher表示,尽管工作繁忙,但他仍然能够大部分时间进行实际操作,并对此感到满意。

🌟 Christopher建议有志于数据科学领域的人士掌握心理学、统计学和机器学习等技能,并通过在线课程和实践项目进行学习和实践。他还强调了沟通和解决问题的能力的重要性。

Recently, Christopher, Managing Partner at Tri5 Ventures, reached out for an interview about “The Life of a Data Scientist”. The intent is to share knowledge and insight with people aspiring to enter the field, or those currently practicing data science.

The article was published a week ago on Tech in Asia and can be found here: “4 Singapore-based data scientists share how data has been impacting lives”. It covers data science professionals across multiple backgrounds, including researchers, entrepreneurs, and startups.

A few people have asked if I could build on what was shared in the article, so I’m reproducing my complete responses to Christopher here.

What made you decide to pursue a career in data science?

TL; DR: My interest in understanding human perception and behaviour led me to pursue psychology, statistics, and machine learning. With these skills, I joined Lazada’s Data Science team in 2015.

Wow, I can’t remember the last time being asked this.

Since my undergrad days (when I was pursuing psychology and statistics), I’ve been interested in how people perceive, think, and behave. This required measuring perception (surveys, etc), behaviour (actions, etc), and outcomes, and making inference. In other words, data and statistics.

I then got an opportunity to apply this in IBM’s Workforce Analytics team, where we built an internal job recommendation engine for employees and tracked outcomes. The goal was to help people level up in their careers, improve job engagement and performance, and reduce attrition.

Along the way, I gained an interest and picked up machine learning. Statistics helps us to understand historical data and make inferences; Machine learning helps us learn from historical data and make predictions.

I then got an opportunity to join Lazada’s data science team in 2015, thanks to Kai Xin and John. I’m not exactly sure when my career in data science started, but I guess joining Lazada Data Science might have been it.

What are 1-2 myths of data science that needs to broken?

Myth: Machine learning is 80% of the work (is this a cliché myth?)
Truth: Machine learning is about 20%.

    About 50% is data understanding, exploration, cleaning, preparation, feature engineering. Another 30% of the time is building a POC, validation/AB testing, developing an API, deployment, etc.

Note: The above breakdown is unique to each data scientist, but it’s roughly how my time is spent.

Myth: I (only) need to be a technical rockstar.
Truth: Being a technical rockstar helps, especially with what I’ve listed above—but it’s not enough to really make an impact. You also need to be able to:

    Identify and understand problems deeply (what is the root cause? are we only fixing the symptoms?) Design end-to-end solutions (it doesn’t help if you build something awesome but people can’t use it) Communicate and convince effectively (if people don’t understand or trust it, they won’t use it)

A simple chart on how my time is spent here.

What are the achievements that data science has done? Any interesting projects that you like to share? They can be external or the projects that you previously did.

Within Lazada, data science has helped in several ways, including:

    Understanding customers/sellers/products better through segmentation, creating profiles, etc. Automating and scaling processes to reduce cost (e.g., machine learning for product categorization, entity resolution, etc) Improving business outcomes (e.g., recommendations, ranking, etc) Helping stakeholders with data (e.g., AB testing support, data deep dives, etc)

In my free time, I’ve been dabbling with using deep learning to:

The possibilities seem limitless, and there’s lots opportunity to use data to create positive impact and improve lives.

What’s life like as a data scientist? Has your perception changed in life since you always rely on data?

This is a pretty broad question.

If you’re asking about how my time is spent, here’s a chart I created a while bachttps://eugeneyan.com/assets/time-spent-as-a-data-scientist.jpg-a-data-scientist.jpg" title="Time spent as a data scientist" loading="lazy" alt="Time spent as a data scientist">

Thankfully, I still get to spend a majority of my time hands-on coding.

If you’re asking about life satisfaction, I’m pretty satisfied. My mission is to use data to create positive impact and improve lives—being a data scientist allows me to achieve this.

Has my perception changed? Yes I guess.

Sometimes, I can’t help but see things in our natural world and wonder how to measure them and use data to improve outcomes.

I’m also a bit more skeptical about claims people make, and tend to be conservative and cautious when there’s no data backing the claim.

Any advice you have to aspiring data scientists?

I’ve received this question numerous times—enough for me to document it here on my site. It covers some tools and skills, where to learn them, and how to practice them.

If you found this useful, please cite this write-up as:

Yan, Ziyou. (Jul 2017). Tech in Asia - My Journey in Data Science and Advice for others. eugeneyan.com. https://eugeneyan.com/speaking/my-sharing-with-tech-in-asia-talk/.

or

@article{yan2017asia,  title   = {Tech in Asia - My Journey in Data Science and Advice for others},  author  = {Yan, Ziyou},  journal = {eugeneyan.com},  year    = {2017},  month   = {Jul},  url     = {https://eugeneyan.com/speaking/my-sharing-with-tech-in-asia-talk/}}
Share on:

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

数据科学 职业发展 机器学习 数据科学家 生涯分享
相关文章