--- title: "良好數據科學家的 7 個習慣" description: "1 - Ain't no homogeneityAsplen-Taylor suggests that the first thing we need to realize is that the data scientist's role is never homogenous ... 5 - The governator factorIt’s no surprise to see data g" type: "news" locale: "zh-HK" url: "https://longbridge.com/zh-HK/news/13007573.md" published_at: "2020-05-01T07:00:21.000Z" --- # 良好數據科學家的 7 個習慣 > 1 - Ain't no homogeneityAsplen-Taylor suggests that the first thing we need to realize is that the data scientist's role is never homogenous ... 5 - The governator factorIt’s no surprise to see data governance and data quality also called out in this list of top-7 traits ... He says that, for the most part, data scientists are generally inexperienced (compared to many other long-established IT roles) and so have probably not been in the job that long — hence the view that they need to be managed ![Front cover image of The State of Open Data Histories and Horizons.](https://imageproxy.pbkrs.com/https://specials-images.forbesimg.com/imageserve/5eabc874228117000681e837/960x0.jpg/query-Zml0PXNjYWxl?x-oss-process=image/auto-orient,1/interlace,1/resize,w_1440,h_1440/quality,q_95/format,jpg) There’s one sure thing you can say about data science — it’s a lot of things. Data science is not necessarily one single thing, skillset or methodology. This is why data science is always said to be an ‘interdisciplinary branch’ of science that combines mathematics, human behavioral and workflow studies, flexible use of logic systems and a core employment of algorithms. This makes being a data scientist pretty hard work, as if algorithmic logic wasn’t already pretty tough. More than just data analytics, more than just big data insight, more than just the ability to handle new streams of raw unstructured data and more than just knowing how to drive a database while blindfolded, data scientists have to understand business and be flexible super-performers. So what core attributes make a good data scientist? Simon Asplen-Taylor is interim chief data officer (CDO) and founder at data analytics advisory company Datatick. He has previously served at casino and online gaming company Rank Group where he and his team have made use of WhereScape technologies for data science centric work, using the WhereScape’s data warehouse automation & big data software. ## 1 - Ain't no homogeneity Asplen-Taylor suggests that the first thing we need to realize is that the data scientist's role is never homogenous. Different skills are required for different tasks in different roles in different ‘digital workflows’ in different industry verticals in different world markets. Today In: Cloud - Big Atoms Make Small, Super-Sensitive Quantum Receivers - Google's Top Quantum Scientist Explains In Detail Why He Resigned - IBM Issues A Public Challenge To Program Its Quantum Computers ## 2 - Data is a business thing He advises that organizations who want to embrace data science competently need to have a data strategy that is aligned to the business goals - and, crucially, it needs to be written by a ‘business savvy’ chief data officer (CDO) who can align all the capabilities of data to the business - increasing revenues, reducing costs, reducing risk, increasing customer and employee satisfaction. ## 3 - Data scientists are experimental “The work of data scientists is, by definition, experimental. They need to be allowed to experiment and the outcomes may or may not be successful, but do enough experiments in the right areas... and you will find the value,” said Asplen-Taylor. “Considering problem solving experimentation further, data scientists need to follow not to lead i.e. they need to be given a problem to fix, which means they need business analysts to define the problem… and, after their experimentation phase, they need someone to test the outcome of their projects, validate the results (so they are not marking their own homework) and they need IT people who will put their models into a production environment… and to then document them (which is key from a data privacy perspective - ensuring that what they are doing is transparent) and support the models.” ## 4 - This is cowboy (person) country Data scientists call the corralling process of bringing different data sets together ‘data wrangling’ in homage to the cattle corralling process that cowboys (now, in 2020, cowpersons, obviously) do out on the range. Asplen-Taylor explains that the reason he and his coworkers saddle up in this way is that if data sets are not engineered properly and ‘productionized’ so that they can be run every day, then they will fail. “The data sets need to be built, automated and deployed to an environment where the data scientists can access them. The vast majority of companies' data sources that are valuable for generating value are within their existing structured systems - so data scientists should first focus their attention on using this data. As the function matures then they can go after different more elusive data sets … but it’s not the starting point,” he said. ## 5 - The governator factor It’s no surprise to see data governance and data quality also called out in this list of top-7 traits. This discipline sits at something of an adjunct to the data scientist i.e. an organization with a fully-fledged IT department should have separately defined data quality team, but the data scientists should know who they are and how competently they will be able to act. ## 6 - Clear and present process “There needs to be a clear process for Data Science so that people in the business know how the projects work. A good industry wide process exists - it's called the CRISP-DM life cycle (Chapman, Clinton, Kerber, et al, 1999), explained Asplen-Taylor. “It was first set up for data mining, one aspect of data science, but can be applied to all. In this way everyone knows the stages of the lifecycle and timescales and resources can be applied. Today people think it’s just magic, it isn’t.” ## 7 - Company-wide mentality As a final factor in this list, Asplen-Taylor says that data scientists need to work with a data architecture that is company-wide. If data scientists define their own architecture and it’s not wholly integrated across the business then they will duplicate much of what has been done already. That's why the software engineering team (i.e. the programmer/developers) needs to build fast and automate, working closely with the data science team. “If all of the above does not happen then the data science people will revert to what is easiest i.e. they will compete with existing Business Intelligence (BI) teams, build their own reports and dashboards and do very little actual science. Companies already know how to do BI and reporting well, and it's not something data scientists should get into. ## It’s early days, still As a parting comment, Asplen-Taylor issues a small plea. He says that, for the most part, data scientists are generally inexperienced (compared to many other long-established IT roles) and so have probably not been in the job that long — hence the view that they need to be managed carefully by the CIO, CTO or other C-suite ‘head suit’. Your organization’s IT department could now be developing this role, so just remember… it’s not rocket science, it’s data science rocketing. ![Simon Asplen-Taylor head and shoulders photo.](https://imageproxy.pbkrs.com/https://specials-images.forbesimg.com/imageserve/5eabc70a9d04a700067fd455/960x0.jpg/query-Zml0PXNjYWxl?x-oss-process=image/auto-orient,1/interlace,1/resize,w_1440,h_1440/quality,q_95/format,jpg) ## Related News & Research | Title | Description | URL | |-------|-------------|-----| | 特朗普暗示违法征收的关税不退了,美财长称今年关税收入将 “基本保持不变” | 美国总统特朗普暗示不会退还被最高法院裁定违法的关税,预计 2026 年关税收入将保持不变。特朗普计划签署行政令,对全球商品加征 10% 进口关税,取代被推翻的关税。财长贝森特表示,政府将利用替代法律权力维持关税收入,强调国家安全和财政收入不 | [Link](https://longbridge.com/zh-HK/news/276494362.md) | | 美财政部让步,拟修订主权财富基金税收提案,此前遭私募业警告 | 美国财政部正就一项针对主权财富基金和公共养老基金征税方式进行全面改革的提案作出让步。相关提案此前由美国国税局提出,拟更新税法第 892 条,将这些基金持有的多数美国债务投资归为商业活动,这将令其面临被征税的风险。此前,私募信贷和私募股权公司 | [Link](https://longbridge.com/zh-HK/news/276491732.md) | | Adamas Trust Pref Share ADAMM 7.875 Perp 01/15/25|10-K:2025 财年营收 6.02 亿美元 | | [Link](https://longbridge.com/zh-HK/news/276492616.md) | | SK 海力士高盛电话会:所有客户需求都无法满足,今年存储价格持续上涨 | SK 海力士在高盛电话会上释放强烈信号:存储行业已全面进入卖方市场。受 AI 真实需求驱动及洁净室空间受限影响,今年存储价格将持续上涨。公司透露目前 DRAM 及 NAND 库存仅剩约 4 周,且没有任何客户能完全满足需求。随着 2026 | [Link](https://longbridge.com/zh-HK/news/276505903.md) | | 传言成真?英伟达对 OpenAI 的 “1000 亿美元投资” 最终 “打了三折” | 据报道,英伟达正接近敲定对 OpenAI 最高 300 亿美元的股权投资,取代此前官宣的 1000 亿美元合作框架。原有协议因条款分歧与内部疑虑未能落地,黄仁勋亦曾强调其不具约束力。此次新融资轮规模或超 1000 亿美元,OpenAI 估值 | [Link](https://longbridge.com/zh-HK/news/276507066.md) | --- > **免責聲明**:本文內容僅供參考,不構成任何投資建議。