Since machine learning is inherently data driven, data is at the core data
of machine learning. The goal of machine learning is to design general-
purpose methodologies to extract valuable patterns from data, ideally
without much domain-specific expertise. For example, given a large corpus
of documents (e.g., books in many libraries), machine learning methods
can be used to automatically find relevant topics that are shared across
documents (Hoffman et al., 2010). To achieve this goal, we design mod-
els that are typically related to the process that generates data, similar to model
the dataset we are given. For example, in a regression setting, the model
would describe a function that maps inputs to real-valued outputs. To
paraphrase Mitchell (1997): A model is said to learn from data if its per-
formance on a given task improves after the data is taken into account.
The goal is to find good models that generalize well to yet unseen data,
which we may care about in the future. Learning can be understood as a learning
way to automatically find patterns and structure in data by optimizing the
parameters of the model.
这段说了一大堆,就是在说机器学习就是找到一种通用模型,然后通过不断的优化参数,在一堆数据集合里获取到有价值的数据。(意翻)
本文探讨了机器学习的核心要素——数据与模型。强调了数据对于机器学习的重要性,介绍了如何通过设计通用模型并不断优化参数,从大量数据中提取有价值的信息。重点讲述了机器学习的目标是寻找能良好泛化的模型,以及学习过程是如何通过参数优化自动发现数据中的模式。
&spm=1001.2101.3001.5002&articleId=89363125&d=1&t=3&u=64ff27d58e8d4980b2a018317555d71c)
6128

被折叠的 条评论
为什么被折叠?



