Google publishes best practice documents on machine learning (ML) engineering to help people who have mastered the basics of ML benefit from Google machine learning best practices. It introduces an ML style similar to the Google C++ style guide and other commonly used practical programming guides.
This document is designed to help people who have mastered the basics of ML to benefit from Google Machine Learning's best practices. It introduces an ML style similar to the Google C++ style guide and other commonly used practical programming guides. If you have studied ML courses, or have experience building or developing ML models, you have the necessary background knowledge to read this document.
the term
When we discuss effective ML, the following terms are repeatedly mentioned:
Example: Things to predict. For example, an instance can be a web page that you want to classify as "cat-related" or "not cat-related."
Tags: The answer to the forecasting task. It can be the answer generated by the ML system or the correct answer provided in the training data. For example, the label of a web page may be "related to cats."
Features: Properties of the instance used in the prediction task. For example, a web page might have the feature "contains the word 'cat'".
Feature Column: A group of related features, such as a collection of all countries where the user may live. The feature column of the sample may contain one or more features. "Feature column" is a Google-specific term. Features listed in the VM system used by Yahoo/Microsoft are called "namespaces" or fields.
Sample: An instance (and its characteristics) and a label.
Model: Statistical representation of the forecasting task. You use a sample to train a model and then use that model to make predictions.
Indicator: A value you care about. Perhaps (but not necessarily) can be directly optimized.
Goal: An indicator of which the algorithm is trying to optimize.
Pipeline: The infrastructure of the ML algorithm. Pipelines include collecting data from the front end, putting data into training data files, training one or more models, and applying the model to the production environment.
Clickthrough Rate: The percentage of visitors who clicked on the link in the ad.
Overview
To create a quality product:
Think of yourself as an outstanding engineer, not an ML expert.
In fact, most of the problems you will face are engineering problems. Even with all the resources that a brilliant ML expert has mastered, most of the gains are brought by suitable features (rather than precise machine learning algorithms). So, the basic way to do ML is:
Make sure the pipeline is solid and reliable from beginning to end.
Start by setting a reasonable goal.
Add commonsense features in a simple way.
Make sure the pipeline is always solid and reliable.
The above method will achieve good results in a long time. As long as you can still make progress with some simple techniques, you should not deviate from the above methods. Increasing complexity will slow down the release of future releases.
When you make full use of all the simple techniques, it may be time to explore ML's cutting-edge technologies. See the third phase of the "ML Project" section.
The structure of this document is as follows:
The first part can help you understand if the time to build an ML system is mature.
The second part explains how to deploy the first pipe.
The third section describes how to publish and iterate when adding new features to the pipeline, how to evaluate the model, and how to deal with training-application bias.
The last section explains what to do when you reach the stabilization phase.
This is followed by a list of relevant resources and appendices. The appendix provides some background information on the systems that are mentioned in this document as examples for many times.
Before ML
Rule 1: Do not be afraid to post products that do not use ML technology.
ML technology is cool, but it requires data. In theory, you can use data from other problems and then adjust the model for new products, but the effect is likely to be less than the basic heuristic algorithm. If you think that ML technology can bring you 100% improvement, then the heuristic algorithm can bring you a 50% increase.
For example, if you want to rank apps in the app market, you can use install rates or install times as heuristic algorithm metrics. If you want to detect spam, you can filter out publishers who have previously sent spam. Also, don't be afraid to modify it manually. If you need to rank contacts, they can be sorted (or sorted alphabetically) near and far in the order they are used. If your product does not have to use ML technology, do not use it until you have enough data.
Rule 2: First design and implement indicators.
Before formalizing the function of the ML system, the value of the indicator is tracked in the current system as much as possible. The reason for this is as follows:
Taking action in advance will make it easier to obtain authorization from users of the system.
If you think that you may need to consider an aspect in the future, it's best to start collecting relevant historical data immediately.
If you consider index measurements when designing your system, you will save a lot of effort in the future. Specifically, you don't want to find strings in your logs to measure metrics later!
You will find what has changed and what hasn’t changed. For example, suppose you want to directly optimize the number of single-day active users. However, during the early manipulation of the system, you may find that a significant change in the user experience does not significantly change the indicator.
The Google+ team measures the number of expansions, forwards, +1s, comments, and comment forwarding times per user for each reading, then uses this data to measure the quality of posts when the model is applied. Also note that the experimental framework is very important. You must group users into buckets in the experimental framework and summarize statistics by experiment. See rule 12.
By collecting metrics in a more liberal way, you can understand your system more fully. Found a problem? Add metrics to track it! Are you excited about some of the volume changes that occurred in the previous version? Add metrics to track it!
Rule 3: Select ML technology rather than sophisticated heuristics.
Simple heuristics are good for launching products. However, complicated heuristic algorithms are difficult to maintain. When you have enough data and basically determine what you are trying to achieve, consider using ML technology. As with most software engineering tasks, you need to constantly update methods (whether they are heuristics or ML models) and you will find that ML models are easier to update and maintain (see Rule 16).
ML Stage 1: Your First Pipeline
Focus on the system infrastructure of the first pipeline. Although it is interesting to look forward to all aspects of the innovative ML that you are about to undertake, if you do not first confirm the reliability of the pipeline, it is difficult to figure out what happened.
Rule 4: Ensure that the first model is easy to use and properly implement the infrastructure.
The first model can improve your product quality most effectively, so it doesn't need to be fancy and easy to use. However, you will encounter many unexpected infrastructure issues. Before you publicly launch a new ML system that you have carefully constructed, you must determine the following:
How to get a sample for your learning algorithm.
Initially determine what is the definition of "good" and "bad" for your system.
How to integrate the model into the application. You can apply the model online or you can use the sample offline to precompute the model and store the result in a table. For example, you may need to pre-categorize web pages and store the results in a table, but you may also want to classify chat messages online.
Selecting simple features makes it easier to ensure:
Apply these features correctly to your learning algorithm.
The model learns reasonable weights.
Apply these features to the server side correctly.
When you have a system that can reliably accomplish the above three points, you have completed most of the work. Simple models can provide you with benchmark metrics and benchmark behaviors that you can use to test more complex models. Some teams use "neutral" as the goal of the first release - clearly dilute machine learning outcomes at the time of initial release to avoid distractions.
Rule 5: Separate the machine learning and test the infrastructure separately.
Make sure that the infrastructure is testable and that the learning part of the system is encapsulated to test everything outside these parts. in particular:
Test the effect of the data import algorithm. Check if the feature column that should be filled is filled. With privacy permission, manually check the data entered into the training algorithm. If possible, look at the statistics in the pipeline and compare it with the statistics of the same data that was processed elsewhere.
The test derives the effect of the model from the training algorithm. Make sure that the model in the training environment gives the same score as the model in the application environment (see rule 37).
ML is unpredictable, so have tests for the code that creates the sample in the training environment and the application environment; and make sure that you can load and use the fixed model during the application. In addition, understanding your data is critical: See Practical advice for analyzing large, complex data sets.
Rule 6: Note the discarded data when copying pipes.
Normally, we create new pipes (ie, cargo worship programming) by duplicating existing pipes, and old pipes discard some of the data needed for new pipes. For example, the pipeline for Google+ Top Messages discards older posts (because it constantly tries to rank the latest posts). This pipe is duplicated for Google+ traffic. In messages, older posts still make sense, but old pipes still discard them. Another common mode is to record only the data that the user sees. Therefore, if we want to model the reason why users cannot see a particular post, such data is useless because the pipeline has discarded all of the negative classification samples. There have been similar problems in Play. While processing the Play app homepage, a new pipeline was created, which also contains samples from the Play Games landing page, but no features distinguish the source of each sample.
Rule 7: Turn heuristic algorithms into features or handle them externally.
In general, the problem ML tries to solve is not entirely new. There is an existing system that can be used to rank, sort, or resolve any issues that you are trying to solve. This means that there are many rules and heuristics. After using ML to make adjustments, this type of heuristic algorithm can provide you with convenience. You should mine your own heuristics to understand any information they contain for the following two reasons. First, the transition to the ML system will be smoother. Second, these rules often contain a large amount of intuition about the system that you do not want to discard.
You can use existing heuristics in four ways:
Use heuristics for preprocessing. If the feature is very good, you can choose to do this. For example, in the spam filter, if the sender is blacklisted, do not try to relearn what the "blacklisted" means. Block this email. This method is best suited for use in binary classification tasks.
Create features. Creating features directly through heuristics is a good practice. For example, if you use a heuristic algorithm to calculate the relevance score for a query result, you can include this score as a feature's value. You may want to use ML techniques later to adjust the value (for example, convert the value to one of a limited set of discrete values, or combine it with other features), but first use the original values ​​generated by the heuristic algorithm.
Mining the heuristic algorithm's original input. If an application heuristic algorithm combines the number of installs, the number of characters in the text, and the day of the week, consider splitting the content and providing it as input to the learning algorithm. Some of the techniques that apply to ensemble learning also apply here (see rule 40).
Modify the tag. You can choose to do this when you feel that the heuristic algorithm will get information that is not included in the current tab. For example, if you are trying to maximize the number of downloads, but you also want good content, the possible solution is to multiply the average number of stars obtained by the application with a tag. You have great flexibility in modifying tags. See "Your first goal."
When using heuristics in an ML system, be sure to watch for additional complexity. Using the old heuristic algorithm in the new ML algorithm can help achieve a smooth transition, but think about whether there is an easier way to achieve the same effect.
monitor
In general, implement good alarm security mechanisms, such as designing steps for resolving alerts and providing an "Information Center" page.
Rule 8: Understand the freshness of your system.
If you use the model one day ago, how much will the effect be reduced? What about the model a week ago? A quarter ago model? This type of message helps you understand what needs to be monitored first. If you do not update the model one day, it will have a serious impact on the quality of your product, it is best to let engineers continue to observe the situation. Most ad serving systems handle new ads every day and must be updated daily. For example, if you do not update the ML model of Google Play Search, it will have a negative impact in less than a month. There are no post identifiers in some models of Google+ Top Messages, so there is no need to export these models often. Other models with post identifiers update much more frequently. Also note that the freshness will change over time, especially when adding or removing feature columns from the model.
Rule 9: Detect the problem first and then export the model.
Many ML systems experience the stage of exporting the model to apply the model. If there is a problem with the exported model, it is a user-oriented issue.
Before exporting the model, check for soundness. Specifically, make sure that the model is reasonable in dealing with reserved data. Or, if you always think there is a problem with the data, don't export the model. Many teams deploying models often check the area under the ROC curve (abbreviated AUC) before exporting the model. When there is a problem with a model that has not yet been exported, an e-mail alert needs to be sent; however, when a user-facing model encounters a problem, it may need to be announced through a page. Therefore, it is best to wait until the inspection is complete and ensure that the model is completely safe before exporting the model to avoid impact on the user.
Rule 10: Pay attention to hidden issues.
This problem is more common with ML systems than with other types of systems. Assuming that the associated specific table is no longer updated, then the ML system will adjust accordingly and its behavior will still be quite good, but it will gradually become worse. Sometimes, you will find that some forms have not been updated in a few months, just refresh it, you can get more effective improvement than all the other improvements made in the corresponding quarter! The coverage of a feature may change as a result of a change: for example, a feature column may be filled in 90% of the sample, but the rate suddenly drops to 60%. Google Play had a table of outdated 6 months, but only refreshed the table, the installation rate increased by 2%. If you track the statistics of your data and manually check the data from time to time, you can reduce this failure.
Rule 11: Provide the owner of the feature column and related documents.
If the system is large and there are many feature columns, you need to know the creator or maintainer of each feature column. If you find that someone who understands a particular feature column is leaving, make sure someone knows the relevant information. Although many feature columns have descriptive names, it is a good idea to provide more detailed descriptions of the meaning of the features, the source, and the way in which they are expected to provide assistance.
Your first goal
You will pay attention to a lot of metrics or measurement results about the system, but you can usually only specify one goal for your ML algorithm, which is your algorithm's "attempted" optimization value. Here, I introduce the difference between goals and indicators: indicators are any number reported by your system and may or may not be important. See also rule 2.
Rule 12: When choosing which goal to optimize directly, do not think too much.
You want to make money, want customer satisfaction, want to make the world a better place. There are a lot of metrics you care about, and you should measure all of them (see Rule 2). However, in the early ML process, you will find that these indicators are on the rise, even those that you did not choose to optimize directly. For example, suppose you are concerned about the number of clicks and how long users stay on the site. If you optimize your clicks, it’s likely that your users’ time on the site will increase.
So, when you can still easily add all the metrics, keep it simple and don't worry too much about how to balance different metrics. However, do not overuse this rule: Do not confuse your goals with the system's final health (see rule 39). In addition, if you find yourself increasing your metrics for direct optimization but decide not to publish your system, you may need to modify some of your goals.
Rule 13: Choose a simple metric that is observable and attributable to your first goal.
You often don't know what the real goal is. You think you know it, but when you look at the data and compare the old system to the new ML system, you find that you want to adjust your goals. In addition, different members of the team usually cannot agree on what is the true goal. The ML goal should be a goal that satisfies the following criteria: an agent that is easy to measure and is a "real" goal. In fact, there is usually no "real" goal (see rule 39). Therefore, please train on simple ML goals and consider adding a "strategic layer" at the top so that you can add other logic (preferably very simple logic) for final ranking.
To model, the simplest indicator is user behavior that is directly observable and attributable to system operations:
Did the user click on this ranked link?
Has the user downloaded this ranked object?
Does the user forward/reply/email this ranked audience?
Did the user rate this ranked audience?
Does the user mark this displayed object as spam/sexual/aggressive content?
Avoid modeling the indirect effects at the beginning:
Did the user visit the site the next day?
How long have users stayed on the site?
How many active daily users?
In fact, indirect influences can be excellent indicators that can be used during A/B testing and release decisions.
Finally, don't try to make the ML system clear about the following issues:
Are users satisfied when using the product?
Is the user happy with the experience?
Does the product improve the overall satisfaction of the user?
How does this affect the overall operation of the company?
All of these are important, but they are also extremely difficult to measure. Use proxy metrics instead: If users are satisfied, they will stay longer on the site. If users are satisfied, they will visit the site again tomorrow. In terms of satisfaction and company operating conditions, human judgment is required to correlate any ML goals with the nature and business plan of the products you sell.
Rule 14: Start with an interpretable model to make debugging easier.
Linear regression, logistic regression, and Poisson regression are directly driven by the probability model. Each forecast can be viewed as a probability or expected value. In this way, the model is easier to debug than models that use the target (0-1 loss, various hinge loss functions, etc.) to try to directly optimize the classification accuracy or rank the effect. For example, there is a problem if there is a discrepancy between the probabilities obtained in training and the probabilities predicted by using side-by-side analysis or by examining the production system.
For example, in Linear Regression, Logistic Regression, or Poisson Regression, there is a portion of the average forecasted expected value equal to the average tag value (first-moment calibration, or just calibration). Assuming you have no regularization and the algorithm has converged, then this is theoretically the case, and in fact this is almost the case. If you have a feature, for each sample whose value is either 0 or 1, the 3 sample sets with feature value 1 are calibrated. In addition, if you have a feature that has a value of 1 for each sample, all sample sets will be calibrated.
With a simple model, you can more easily handle feedback loops (see rule 36). In general, we make decisions based on these probabilistic forecasts; for example, rank the posts in descending order based on the expected values ​​(click probability/download probability, etc.). However, please note that when choosing the model to use, your decision is more important than the probability given by the model (see rule 27).
Rule 15: Distinguish junk content filtering and quality ranking in the policy layer.
Quality ranking is an art, but spam filtering is like a war. For users who use your system, the signals you use to determine high quality posts will become apparent, and these users will adjust their posts to have the attributes of high quality posts. Therefore, your quality ranking should focus on ranking honestly published content. You should not apply discounts to spam content because the Quality Ranking Learner ranks spam. Similarly, content that is "not suitable for children" should not be dealt with in the quality ranking. Spam filtering is another matter. You must understand that the characteristics that need to be generated will constantly change. Normally, you will set some obvious rules in the system (if a post has received more than three spam reports, do not retrieve the message, etc.). All learning models must be updated at least daily. Content creators' reputation will play a big role.
At a certain level, the output of these two systems must be integrated together. Please note that when filtering spam in search results, it may be more proactive than filtering spam. The premise of this statement is that you have no regularization and the algorithm has converged. This is generally the case. In addition, it is a standard practice to remove spam from the training data of the mass classifier.
ML Second Stage: Feature Engineering
In the first phase of the ML system life cycle, the important issues involved three aspects: the introduction of training data into the learning system, measurement of any indicators of interest, and the construction of application infrastructure. After you have built an end-to-end stable operation system and have developed system tests and unit tests, you can enter the second stage.
Many of the goals of the second phase are easy to implement, and there are many obvious features that can be introduced into the system. Therefore, the second phase of ML involves introducing as many features as possible and combining them in an intuitive way. At this stage, all the indicators should still be on the rise, you will release the system several times, and it is very suitable for arranging multiple engineers to integrate all the data needed to create truly outstanding learning systems.
Rule 16: Develop a release and iterative model plan.
Don't expect that the model you are building now will be the last model you are about to publish, and don't expect you to stop publishing models. Therefore, please consider whether the increased complexity in this release will slow the release of future releases. Many teams release one or more models every quarter over the years. The three basic reasons for publishing a new model are as follows:
You are about to add new features.
You will adjust the regularization and combine the old features in new ways.
You will adjust the goal.
In any case, there are no disadvantages to consider when building a model: Viewing the data provided to the sample helps to discover new signals, old signals, and damaged signals. Therefore, when building a model, consider how easy it is to add, remove, or recombine features. Consider creating a fresh copy of the pipeline and how easy it is to verify its correctness. Consider whether you can run two or three copies at the same time. Finally, there is no need to worry about whether this version of the pipeline includes the 16th feature (35 in total), which will be included in the next quarter.
Rule 17: Start with characteristics that can be observed and reported directly (rather than learned characteristics).
This point may be controversial, but many problems can be avoided. First of all, let's introduce what is the characteristics of learning. Features learned are features that are generated by external systems (such as unsupervised cluster systems) or the learners themselves (such as through factor models or deep learning). The features generated by these two methods are very useful, but they cause many problems and should not be used in the first model.
If you use external systems to create features, be aware that external systems have their own goals. There may be only a little correlation between the goals of the external system and your current goals. If you acquire a transient state of an external system, it may expire. If you update features from an external system, the meaning of the features may change. If you use external systems to provide features, please note that this method requires great care.
The main problem with the factor model and the depth model is that they are non-convex models. Therefore, there is no guarantee that the optimal solution can be simulated or found, and the local minima found at each iteration may be different. This change makes it difficult to determine whether the impact of a change in the system is meaningful or random. By creating a model with no depth features, you can get excellent benchmark results. After reaching this baseline, you can try a more esoteric approach.
Rule 18: Explore the characteristics of content that can be generalized across contexts.
ML systems are usually just a small part of a larger system. For example, imagine posts that are likely to be used in popular messages. Many users have +1'd, forwarded, or commented on popular posts before they were displayed. If you provide these statistics to the learner, it will promote new posts that have no data in the optimized scenario. YouTube's "Next Watch" can use YouTube Views or Views (Views of another video after watching one video) or explicit user ratings to recommend content. Finally, if you use a user action as a tag, seeing the user perform the action on the document in other situations can be a good feature. With all of these features, you can introduce new content into the situation. Please note that this has nothing to do with personalization: first find out if someone likes the content in this situation and then figure out how much to like.
Rule 19: Use very specific characteristics wherever possible.
For massive data, learning millions of simple features is easier than learning a few complex features. The identifier of the document being retrieved and the normalized query will not provide much generalization, but it will keep your ranking consistent with the tags of the most frequently used queries. Therefore, don't be afraid of feature groups that have the following characteristics: Each feature applies to a small portion of your data but overall coverage is above 90%. You can use regularization to eliminate features that apply too few samples.
Rule 20: Combine and modify existing features to create new features in an easy-to-understand way.
There are many ways to combine and modify features. With an ML system such as TensorFlow, you can preprocess the data through transformations. The two most standard methods are "discretization" and "combination."
"Discretization" refers to extracting a continuous feature and creating many discrete features from it. Take the continuous feature of age as an example. You can create a feature with a value of 1 when you are under 18 and create another feature with a value of 1 between ages 18-35, and so on. Don't worry too much about the boundaries of these histograms: Basic quantiles have the greatest impact on you.
The "combination" method refers to the combination of two or more feature columns. In TensorFlow, characteristic columns refer to homogeneous feature sets (for example, {men, women}, {United States, Canada, Mexico}, etc.). A combination refers to a new feature column that contains features such as {male, female} × {United States, Canada, Mexico}. This new feature column will contain features (male, Canada). If you are using TensorFlow and have TensorFlow create this combination for you, this (male, Canadian) feature will exist in a sample that represents a Canadian male. Note that you need to have a lot of data to use a combined learning model with three, four, or more datum feature columns.
Combinations that generate very large feature columns may overfit. For example, suppose you are performing some sort of search. One of your characteristic columns contains the words in the query and the other one contains the words in the document. At this point, you can combine these feature columns using the "combination" method, but you end up with many features (see rule 21).
There are two alternative methods for processing text. The most demanding method is dot product. When the dot product method uses the simplest form, only the number of common words between the query and the document is calculated. Then this feature is discretized. Another method is intersection: if the intersection method is used, a feature will appear if and only if both the document and the query contain the word "pony"; if and only if both the document and the query contain the word "the" Only another feature will appear.
Rule 21: The number of feature weights you can learn in a linear model is roughly proportional to the amount of data you have.
There are various excellent results of statistical learning theory on the appropriate complexity of the model, but you basically only need to understand this rule. In a conversation, people have expressed such doubts: Can we learn anything from a thousand samples, or if we need more than one million samples, they have such doubts because they are limited to A specific learning style. The key is to adjust your learning model based on the data size:
If you are building a search ranking system with millions of different words in documents and queries, and you have 1000 labeled samples, you should manually engineer in document and query features, TF-IDF, and multiple other heights There is a dot product between the characteristics of the chemical. You have 1000 samples and more than a dozen features.
If you have one million samples, use regularization and feature selection (possibly) to intersect the document feature column with the query feature column. As a result, you will get millions of features; but if you use regularization, you get fewer features. You will have tens of millions of samples and may produce 100,000 features.
If you have billions or hundreds of billions of samples, you can use feature selection and regularization to combine feature columns with document and query tags. You will have one billion samples and ten million features. Statistical learning theory rarely sets strict limits, but it can provide a good starting point for guidance.
Finally, according to Rule 28 decide which features to use.
Rule 22: Clean up features that are no longer used.
Unused features generate technical liabilities. If you find yourself not using a feature, and combining it with other features does not work, then remove it from your infrastructure. You need to keep your infrastructure simple so that you try the features that are most likely to bring good results as quickly as possible. If necessary, Others can add your characteristics back at any time.
When determining which features to add or retain, consider coverage. That is, how many samples are covered by the corresponding features? For example, if you have some personalization features, but only 8% of users have personalization features, the effect will not be very good.
At the same time, some features may exceed their weight. For example, if one of your characteristics covers only 1% of the data, but 90% of the samples with that characteristic are positive classifications, this is a good feature that can be added.
Manual analysis of the system
Before continuing to explore the third phase of ML, it is important to focus on what you cannot learn in any ML course: how to examine existing models and make improvements. This is more like an art than science, but there are several anti-patterns that need to be avoided.
Rule 23: You are not a typical end user.
This may be the easiest way to get the team in trouble. Although there are many advantages to fishfood (using prototypes within the team) and dogfood (using prototypes within the company), employees should see if they meet the performance requirements. Although you should avoid applying significantly worse changes, when it comes to production, any changes that seem reasonable should be further tested, in two ways: ask non-professionals to answer paid questions on a crowdsourcing platform, or Users conduct online experiments.
There are two reasons for this. First of all, your relationship with the code is too close. You may be concerned with a particular aspect of the post, or you may just be putting too much emotion into it (such as confirming deviations). Second, your time is precious. Consider how many of the nine engineer's expenses spent on an hour-long meeting can buy on the crowdsourcing platform.
If you really want to get user feedback, use the user experience method.在æµç¨‹çš„早期阶段创建用户角色(请å‚阅比尔- 布克斯顿的Sketching User Experiences 一书ä¸çš„æ述),然åŽè¿›è¡Œå¯ç”¨æ€§æµ‹è¯•ï¼ˆè¯·å‚阅å²è’‚夫- å…‹é²æ ¼çš„Don't Make Me Think 一书ä¸çš„æ述)。用户角色是指创建å‡æƒ³ç”¨æˆ·ã€‚例如,如果您的团队æˆå‘˜éƒ½æ˜¯ç”·æ€§ï¼Œåˆ™æœ‰å¿…è¦è®¾è®¡ä¸€ä¸ª35 å²çš„女性用户角色(使用用户特å¾å®Œæˆï¼‰ï¼Œå¹¶æŸ¥çœ‹å…¶ç”Ÿæˆçš„结果,而ä¸æ˜¯åªæŸ¥çœ‹10 ä½25-40 å²ç”·æ€§çš„结果。在å¯ç”¨æ€§æµ‹è¯•ä¸è¯·çœŸå®žç”¨æˆ·ä½“验您的网站(通过本地或远程方å¼ï¼‰å¹¶è§‚察他们的å应也å¯ä»¥è®©æ‚¨ä»¥å…¨æ–°çš„视角看待问题。
第24 æ¡è§„则:衡é‡æ¨¡åž‹é—´çš„差异。
在å‘任何用户展示您的新模型之å‰ï¼Œæ‚¨å¯ä»¥è¿›è¡Œçš„最简å•ï¼ˆæœ‰æ—¶ä¹Ÿæ˜¯æœ€æœ‰ç”¨ï¼‰çš„一项衡é‡æ˜¯ï¼Œè¯„估新模型的结果与生产有多大差别。例如,如果您有一项排å任务,则在整个系统ä¸é’ˆå¯¹ä¸€æ‰¹ç¤ºä¾‹æŸ¥è¯¢è¿è¡Œè¿™ä¸¤ä¸ªæ¨¡åž‹ï¼Œå¹¶æŸ¥çœ‹ç»“果的对称差分有多大(按排åä½ç½®åŠ æƒï¼‰ã€‚如果差分éžå¸¸å°ï¼Œé‚£ä¹ˆæ‚¨æ— 需è¿è¡Œå®žéªŒï¼Œå°±å¯ä»¥åˆ¤æ–ä¸ä¼šå‡ºçŽ°å¾ˆå¤§å˜åŒ–。如果差分很大,那么您需è¦ç¡®ä¿è¿™ç§æ›´æ”¹å¯ä»¥å¸¦æ¥å¥½çš„结果。查看对称差分较大的查询有助于您了解更改的性质。ä¸è¿‡ï¼Œè¯·ç¡®ä¿æ‚¨çš„系统是稳定的。确ä¿æ¨¡åž‹ä¸Žè‡ªèº«ä¹‹é—´çš„对称差分较低(ç†æƒ³æƒ…况下为零)。
第25 æ¡è§„则:选择模型时,实用效果比预测能力更é‡è¦ã€‚
您的模型å¯èƒ½ä¼šå°è¯•é¢„æµ‹ç‚¹å‡»çŽ‡ã€‚ä½†å½’æ ¹åˆ°åº•ï¼Œå…³é”®é—®é¢˜åœ¨äºŽæ‚¨ç”¨è¿™ç§é¢„测åšä»€ä¹ˆã€‚如果您使用该预测对文档进行排å,那么最终排åçš„è´¨é‡æ¯”预测本身更é‡è¦ã€‚如果您è¦é¢„测一个文档是垃圾内容的概率,然åŽé€‰æ‹©ä¸€ä¸ªå–èˆç‚¹æ¥ç¡®å®šè¦é˜»æ–的内容,那么å…许的内容的精确率更为é‡è¦ã€‚大多数情况下,这两项应该是一致的:当它们ä¸ä¸€è‡´æ—¶ï¼Œå¸¦æ¥çš„优势å¯èƒ½ä¼šéžå¸¸å°ã€‚å› æ¤ï¼Œå¦‚æžœæŸç§æ›´æ”¹å¯ä»¥æ”¹å–„对数æŸå¤±ï¼Œä½†ä¼šé™ä½Žç³»ç»Ÿçš„性能,则查找其他特å¾ã€‚当这ç§æƒ…况开始频ç¹å‘生时,说明您该é‡æ–°å®¡è§†æ¨¡åž‹çš„ç›®æ ‡äº†ã€‚
第26 æ¡è§„则:在衡é‡çš„错误ä¸å¯»æ‰¾è§„律,并创建新特å¾ã€‚
å‡è®¾æ‚¨çœ‹åˆ°æ¨¡åž‹"弄错" 了一个è®ç»ƒæ ·æœ¬ã€‚在分类任务ä¸ï¼Œè¿™ç§é”™è¯¯å¯èƒ½æ˜¯å‡æ£ä¾‹ï¼Œä¹Ÿå¯èƒ½æ˜¯å‡è´Ÿä¾‹ã€‚在排å任务ä¸ï¼Œè¿™ç§é”™è¯¯å¯èƒ½æ˜¯å‡æ£ä¾‹å’Œå‡è´Ÿä¾‹ï¼Œå…¶ä¸æ£ä¾‹çš„排å比负例的排å低。最é‡è¦çš„是,ML系统知é“è‡ªå·±å¼„é”™äº†è¯¥æ ·æœ¬ï¼Œå¦‚æžœæœ‰æœºä¼šï¼Œå®ƒä¼šä¿®å¤è¯¥é”™è¯¯ã€‚如果您å‘该模型æ供一个å…许其修æ£é”™è¯¯çš„特å¾ï¼Œè¯¥æ¨¡åž‹ä¼šå°è¯•ä½¿ç”¨å®ƒã€‚
å¦ä¸€æ–¹é¢ï¼Œå¦‚果您å°è¯•æ ¹æ®ç³»ç»Ÿä¸ä¼šè§†ä¸ºé”™è¯¯çš„æ ·æœ¬åˆ›å»ºä¸€ä¸ªç‰¹å¾ï¼Œè¯¥ç‰¹å¾å°†ä¼šè¢«ç³»ç»Ÿå¿½ç•¥ã€‚例如,å‡è®¾ç”¨æˆ·åœ¨Play 应用æœç´¢ä¸æœç´¢"å…费游æˆ"。å‡è®¾æŽ’åé å‰çš„æœç´¢ç»“æžœä¸æœ‰ä¸€ä¸ªæ˜¯ç›¸å…³æ€§è¾ƒä½Žçš„æžç¬‘åº”ç”¨ã€‚å› æ¤ï¼Œæ‚¨ä¸º"æžç¬‘应用" 创建了一个特å¾ã€‚但是,如果您è¦æœ€å¤§é™åº¦åœ°å¢žåŠ 安装次数,并且用户在æœç´¢å…费游æˆæ—¶å®‰è£…了æžç¬‘应用,那么"æžç¬‘应用" 特å¾ä¸ä¼šè¾¾åˆ°æ‚¨æƒ³è¦çš„效果。
如果模型弄错了您的æŸäº›æ ·æœ¬ï¼Œè¯·åœ¨å½“å‰ç‰¹å¾é›†ä¹‹å¤–寻找规律。例如,如果系统似乎在é™ä½Žå†…容较长的帖å的排åï¼Œé‚£ä¹ˆæ·»åŠ å¸–å长度。ä¸è¦æ·»åŠ 过于具体的特å¾ã€‚如果您è¦æ·»åŠ 帖å长度,请ä¸è¦è¯•å›¾çŒœæµ‹é•¿åº¦çš„具体å«ä¹‰ï¼Œåªéœ€æ·»åŠ å多个特å¾ï¼Œç„¶åŽè®©æ¨¡åž‹è‡ªè¡Œå¤„ç†ï¼ˆè¯·å‚阅第二å一æ¡è§„åˆ™ï¼‰ã€‚è¿™æ˜¯å®žçŽ°ç›®æ ‡æœ€ç®€å•çš„æ–¹å¼ã€‚
第27 æ¡è§„则:å°è¯•é‡åŒ–观察到的异常行为。
当现有的æŸå¤±å‡½æ•°æ²¡æœ‰æ•èŽ·æ‚¨å›¢é˜Ÿä¸çš„部分æˆå‘˜ä¸å–œæ¬¢çš„æŸäº›ç³»ç»Ÿå±žæ€§æ—¶ï¼Œä»–们会开始有挫败感。æ¤æ—¶ï¼Œä»–们应该ç«å°½æ‰€èƒ½å°†æŠ±æ€¨è½¬æ¢æˆå…·ä½“çš„æ•°å—。例如,如果他们认为Play æœç´¢ä¸æ˜¾ç¤ºçš„"æžç¬‘应用" 过多,则å¯ä»¥é€šè¿‡äººå·¥è¯„分识别æžç¬‘应用。(在这ç§æƒ…况下,您å¯ä»¥ä½¿ç”¨äººå·¥æ ‡è®°çš„æ•°æ®ï¼Œå› 为相对较少的一部分查询å 了很大一部分æµé‡ã€‚)如果您的问题是å¯è¡¡é‡çš„,那么您å¯ä»¥å¼€å§‹å°†å®ƒä»¬ç”¨ä½œç‰¹å¾ã€ç›®æ ‡æˆ–æŒ‡æ ‡ã€‚ä¸€èˆ¬è§„åˆ™æ˜¯" å…ˆé‡åŒ–,å†ä¼˜åŒ– "。
第28 æ¡è§„则:请注æ„,çŸæœŸè¡Œä¸ºç›¸åŒå¹¶ä¸æ„味ç€é•¿æœŸè¡Œä¸ºä¹Ÿç›¸åŒã€‚
å‡è®¾æ‚¨çš„新系统会查看æ¯ä¸ªdoc_id å’Œexact_query,然åŽè®¡ç®—æ¯ä¸ªæŸ¥è¯¢çš„æ¯ä¸ªæ–‡æ¡£çš„点击概率。您å‘现在并排分æžå’ŒA/B 测试ä¸ï¼Œå…¶è¡Œä¸ºä¸Žæ‚¨å½“å‰ç³»ç»Ÿçš„è¡Œä¸ºå‡ ä¹Žå®Œå…¨ç›¸åŒï¼Œè€ƒè™‘到它的简å•æ€§ï¼Œæ‚¨å‘布了它。ä¸è¿‡ï¼Œæ‚¨å‘现它没有显示任何新应用。 why?é‚£æ˜¯å› ä¸ºæ‚¨çš„ç³»ç»Ÿä»…æ ¹æ®è‡ªå·±çš„查询历å²è®°å½•æ˜¾ç¤ºæ–‡æ¡£ï¼Œæ‰€ä»¥ä¸çŸ¥é“应该显示新文档。
了解这ç§ç³»ç»Ÿé•¿æœŸè¡Œä¸ºçš„唯一方法是,仅使用模型在线时获得的数æ®å¯¹å…¶è¿›è¡Œè®ç»ƒã€‚这一点éžå¸¸éš¾ã€‚
è®ç»ƒ- 应用åå·®
è®ç»ƒ- 应用å差是指è®ç»ƒæ•ˆæžœä¸Žåº”用效果之间的差异。出现这ç§åå·®çš„åŽŸå› å¯èƒ½æ˜¯ï¼š
è®ç»ƒç®¡é“和应用管é“ä¸æ•°æ®çš„处ç†æ–¹å¼æœ‰å·®å¼‚。
è®ç»ƒæ—¶å’Œåº”用时所用数æ®æœ‰å˜åŒ–。
模型和算法之间有å馈环。
我们注æ„到Google 的生产ML系统也å˜åœ¨è®ç»ƒ- 应用å差,这ç§å差对性能产生了负é¢å½±å“。最好的解决方案是明确进行监控,以é¿å…在系统和数æ®æ”¹å˜æ—¶å¼•å…¥å®¹æ˜“被忽视的å差。
第29 æ¡è§„则:确ä¿è®ç»ƒæ•ˆæžœå’Œåº”ç”¨æ•ˆæžœä¸€æ ·çš„æœ€ä½³æ–¹æ³•æ˜¯ï¼Œä¿å˜åœ¨åº”用时使用的特å¾é›†ï¼Œç„¶åŽå°†è¿™äº›ç‰¹å¾é€šè¿‡ç®¡é“ä¼ è¾“åˆ°æ—¥å¿—ï¼Œä»¥ä¾¿åœ¨è®ç»ƒæ—¶ä½¿ç”¨ã€‚
å³ä½¿æ‚¨ä¸èƒ½å¯¹æ¯ä¸ªæ ·æœ¬éƒ½è¿™æ ·åšï¼Œä¹Ÿå¯¹ä¸€å°éƒ¨åˆ†æ ·æœ¬è¿™æ ·åšï¼Œä»¥ä¾¿éªŒè¯åº”用和è®ç»ƒä¹‹é—´çš„一致性(请å‚阅第37 æ¡è§„则)。采å–了这项措施的Google 团队有时会对结果感到惊讶。 YouTube 首页改用这ç§åœ¨åº”用时记录特å¾çš„åšæ³•åŽï¼Œä¸ä»…大大æ高了质é‡ï¼Œè€Œä¸”å‡å°‘了代ç å¤æ‚度。目å‰æœ‰è®¸å¤šå›¢é˜Ÿéƒ½å·²ç»åœ¨å…¶åŸºç¡€è®¾æ–½ä¸Šé‡‡ç”¨äº†è¿™ç§æ–¹æ³•ã€‚
第30 æ¡è§„则:按é‡è¦æ€§å¯¹é‡‡æ ·æ•°æ®åŠ æƒï¼Œä¸è¦éšæ„丢弃它们ï¼
æ•°æ®è¿‡å¤šæ—¶ï¼Œæ€»ä¼šå¿ä¸ä½é‡‡ç”¨å‰é¢çš„文件而忽略åŽé¢çš„文件。这是错误的åšæ³•ã€‚尽管å¯ä»¥ä¸¢å¼ƒä»Žæœªå‘用户展示过的数æ®ï¼Œä½†å¯¹äºŽå…¶ä»–æ•°æ®æ¥è¯´ï¼ŒæŒ‰é‡è¦æ€§åŠ æƒæ˜¯æœ€ä½³é€‰æ‹©ã€‚按é‡è¦æ€§åŠ æƒæ„味ç€ï¼Œå¦‚果您决定以30% çš„æ¦‚çŽ‡å¯¹æ ·æœ¬X è¿›è¡ŒæŠ½æ ·ï¼Œé‚£ä¹ˆå‘其赋予10/3 çš„æƒé‡ã€‚按é‡è¦æ€§åŠ æƒæ—¶ï¼Œæ‚¨ä»ç„¶å¯ä»¥ä½¿ç”¨ç¬¬14 æ¡è§„则ä¸è®¨è®ºçš„æ‰€æœ‰æ ¡å‡†å±žæ€§ã€‚
第31 æ¡è§„则:如果您在è®ç»ƒå’Œåº”用期间关è”è¡¨æ ¼ä¸çš„æ•°æ®ï¼Œè¯·æ³¨æ„ï¼Œè¡¨æ ¼ä¸çš„æ•°æ®å¯èƒ½ä¼šå˜åŒ–。
å‡è®¾æ‚¨å°†æ–‡æ¡£ID 与包å«è¿™äº›æ–‡æ¡£çš„特å¾ï¼ˆä¾‹å¦‚è¯„è®ºæ¬¡æ•°æˆ–ç‚¹å‡»æ¬¡æ•°ï¼‰çš„è¡¨æ ¼ç›¸å…³è”ã€‚è¡¨æ ¼ä¸çš„特å¾åœ¨è®ç»ƒæ—¶å’Œåº”用时å¯èƒ½æœ‰æ‰€ä¸åŒã€‚那么,您的模型在è®ç»ƒæ—¶å’Œåº”用时对åŒä¸€æ–‡æ¡£çš„预测就å¯èƒ½ä¼šä¸åŒã€‚è¦é¿å…这类问题,最简å•çš„方法是在应用时记录特å¾ï¼ˆè¯·å‚阅第32 æ¡è§„åˆ™ï¼‰ã€‚å¦‚æžœè¡¨æ ¼åªæ˜¯ç¼“æ…¢å‘生å˜åŒ–,那么您还å¯ä»¥æ¯å°æ—¶æˆ–æ¯å¤©åˆ›å»ºè¡¨æ ¼å¿«ç…§ï¼Œä»¥èŽ·å¾—éžå¸¸æŽ¥è¿‘çš„æ•°æ®ã€‚请注æ„,这ä»ä¸èƒ½å®Œå…¨è§£å†³é—®é¢˜ã€‚
第32 æ¡è§„则:尽å¯èƒ½åœ¨è®ç»ƒç®¡é“和应用管é“é—´é‡å¤ä½¿ç”¨ä»£ç 。
批处ç†ä¸åŒäºŽåœ¨çº¿å¤„ç†ã€‚进行在线处ç†æ—¶ï¼Œæ‚¨å¿…须在æ¯ä¸ªè¯·æ±‚到达时对其进行处ç†ï¼ˆä¾‹å¦‚,您必须为æ¯ä¸ªæŸ¥è¯¢å•ç‹¬è¿›è¡ŒæŸ¥æ‰¾ï¼‰ï¼Œè€Œè¿›è¡Œæ‰¹å¤„ç†æ—¶ï¼Œæ‚¨å¯ä»¥ç»„åˆä»»åŠ¡ï¼ˆä¾‹å¦‚进行关è”)。应用时,您进行的是在线处ç†ï¼Œè€Œè®ç»ƒæ—¶ï¼Œæ‚¨è¿›è¡Œçš„是批处ç†ã€‚ä¸è¿‡ï¼Œæ‚¨å¯ä»¥é€šè¿‡ä¸€äº›æ–¹æ³•æ¥é‡å¤ä½¿ç”¨ä»£ç 。例如,您å¯ä»¥ä¸“门为自己的系统创建一个对象,其ä¸æ‰€æœ‰æŸ¥è¯¢ç»“果和关è”都能以éžå¸¸æ˜“于人类读å–çš„æ–¹å¼è¿›è¡Œå˜å‚¨ï¼Œä¸”错误也å¯ä»¥è½»æ¾è¿›è¡Œæµ‹è¯•ã€‚然åŽï¼Œæ”¶é›†äº†æ‰€æœ‰ä¿¡æ¯åŽï¼Œæ‚¨å¯ä»¥åœ¨åº”用和è®ç»ƒæœŸé—´ä½¿ç”¨ä¸€ç§å…±åŒçš„方法,在人类å¯è¯»å¯¹è±¡ï¼ˆç‰¹å®šäºŽæ‚¨çš„系统)和ML需è¦çš„ä»»ä½•æ ¼å¼ä¹‹é—´æž¶èµ·ä¸€åº§æ¡¥æ¢ã€‚è¿™æ ·å¯ä»¥æ¶ˆé™¤è®ç»ƒ- 应用åå·®çš„ä¸€ä¸ªæ ¹æºã€‚ç”±æ¤æŽ¨çŸ¥ï¼Œåœ¨è®ç»ƒå’Œåº”用时,尽é‡ä¸è¦ä½¿ç”¨ä¸¤ç§ä¸åŒçš„编程è¯è¨€ã€‚å¦‚æžœè¿™æ ·åšï¼Œå°±å‡ 乎ä¸å¯èƒ½å…±äº«ä»£ç 了。
第33 æ¡è§„åˆ™ï¼šå¦‚æžœæ‚¨æ ¹æ®1 月5 日之å‰çš„æ•°æ®ç”Ÿæˆæ¨¡åž‹ï¼Œåˆ™æ ¹æ®1 月6 æ—¥åŠä¹‹åŽçš„æ•°æ®æµ‹è¯•æ¨¡åž‹ã€‚
一般æ¥è¯´ï¼Œè¦è¡¡é‡æ¨¡åž‹çš„效果,应使用在è®ç»ƒæ¨¡åž‹æ‰€æœ‰æ•°æ®å¯¹åº”的日期之åŽçš„日期收集的数æ®ï¼Œå› ä¸ºè¿™æ ·èƒ½æ›´å¥½åœ°åæ˜ ç³»ç»Ÿåº”ç”¨åˆ°ç”Ÿäº§æ—¶çš„è¡Œä¸ºã€‚å¦‚æžœæ‚¨æ ¹æ®1 月5 日之å‰çš„æ•°æ®ç”Ÿæˆæ¨¡åž‹ï¼Œåˆ™æ ¹æ®1 月6 æ—¥åŠä¹‹åŽçš„æ•°æ®æµ‹è¯•æ¨¡åž‹ã€‚您一般会å‘现,使用新数æ®æ—¶æ¨¡åž‹çš„效果ä¸å¦‚原æ¥å¥½ï¼Œä½†åº”该ä¸ä¼šå¤ªç³Ÿã€‚由于å¯èƒ½å˜åœ¨çš„一些日常影å“,您å¯èƒ½æ²¡æœ‰é¢„测到平å‡ç‚¹å‡»çŽ‡æˆ–转化率,但曲线下é¢ç§¯ï¼ˆè¡¨ç¤ºæ£åˆ†ç±»æ ·æœ¬çš„åˆ†æ•°é«˜äºŽè´Ÿåˆ†ç±»æ ·æœ¬çš„æ¦‚çŽ‡ï¼‰åº”è¯¥éžå¸¸æŽ¥è¿‘。
第34 æ¡è§„则:在有关过滤的二元分类(例如,垃圾邮件检测或确定有趣的电å邮件)ä¸ï¼Œåœ¨çŸæœŸå†…å°å°ç‰ºç‰²ä¸€ä¸‹æ•ˆæžœï¼Œä»¥èŽ·å¾—éžå¸¸çº¯å‡€çš„æ•°æ®ã€‚
在过滤任务ä¸ï¼Œæ ‡è®°ä¸ºè´Ÿåˆ†ç±»çš„æ ·æœ¬ä¸ä¼šå‘用户显示。å‡è®¾æ‚¨çš„过滤器在应用时å¯å±è”½75% çš„è´Ÿåˆ†ç±»æ ·æœ¬ã€‚æ‚¨å¯èƒ½ä¼šå¸Œæœ›ä»Žå‘用户显示的实例ä¸æå–é¢å¤–çš„è®ç»ƒæ•°æ®ã€‚例如,如果用户将您的过滤器未å±è”½çš„电åé‚®ä»¶æ ‡è®°ä¸ºåžƒåœ¾é‚®ä»¶ï¼Œé‚£ä¹ˆæ‚¨å¯èƒ½æƒ³è¦ä»Žä¸å¦ä¹ 规律。
但这ç§æ–¹æ³•ä¼šå¼•å…¥é‡‡æ ·å差。如果您改为在应用期间将所有æµé‡çš„1% æ ‡è®°ä¸º"预留",并å‘用户å‘é€æ‰€æœ‰é¢„ç•™æ ·æœ¬ï¼Œåˆ™æ‚¨å¯ä»¥æ”¶é›†æ›´çº¯å‡€çš„æ•°æ®ã€‚现在,过滤器å±è”½äº†è‡³å°‘74% çš„è´Ÿåˆ†ç±»æ ·æœ¬ã€‚è¿™äº›é¢„ç•™æ ·æœ¬å¯ä»¥æˆä¸ºè®ç»ƒæ•°æ®ã€‚
请注æ„,如果过滤器å±è”½äº†95% æˆ–ä»¥ä¸Šçš„è´Ÿåˆ†ç±»æ ·æœ¬ï¼Œåˆ™æ¤æ–¹æ³•çš„å¯è¡Œæ€§ä¼šé™ä½Žã€‚å³ä¾¿å¦‚æ¤ï¼Œå¦‚果您希望衡é‡åº”用效果,å¯ä»¥è¿›è¡Œæ›´ä½Žæ¯”ä¾‹çš„é‡‡æ ·ï¼ˆæ¯”å¦‚0.1% 或0.001%ï¼‰ã€‚ä¸€ä¸‡ä¸ªæ ·æœ¬è¶³ä»¥éžå¸¸å‡†ç¡®åœ°è¯„估效果。
第35 æ¡è§„则:注æ„排å问题ä¸å˜åœ¨çš„固有å差。
当您彻底改å˜æŽ’å算法,导致出现ä¸åŒçš„排å结果时,实际上改å˜äº†æ‚¨çš„算法以åŽä¼šå¤„ç†çš„æ•°æ®ã€‚这时,就会出现固有å差,您应该围绕这ç§åå·®æ¥è®¾è®¡æ¨¡åž‹ã€‚具体方法有多ç§ã€‚以下是让您的模型é’çå·²è§è¿‡çš„æ•°æ®çš„方法。
对覆盖更多查询的特å¾ï¼ˆè€Œä¸æ˜¯ä»…覆盖一个查询的特å¾ï¼‰è¿›è¡Œæ›´é«˜çš„æ£åˆ™åŒ–。通过这ç§æ–¹å¼ï¼Œæ¨¡åž‹å°†é’çä¸“é—¨é’ˆå¯¹ä¸€ä¸ªæˆ–å‡ ä¸ªæŸ¥è¯¢çš„ç‰¹å¾ï¼Œè€Œä¸æ˜¯æ³›åŒ–到所有查询的特å¾ã€‚è¿™ç§æ–¹æ³•æœ‰åŠ©äºŽé˜²æ¢å分çƒé—¨çš„查询结果显示到ä¸ç›¸å…³çš„查询ä¸ã€‚请注æ„ï¼Œè¿™ä¸Žä»¥ä¸‹æ›´ä¸ºä¼ ç»Ÿçš„å»ºè®®ç›¸å·¦ï¼šå¯¹å…·æœ‰æ›´å¤šå”¯ä¸€å€¼çš„ç‰¹å¾åˆ—进行更高的æ£åˆ™åŒ–。
ä»…å…许特å¾å…·æœ‰æ£æƒé‡ã€‚è¿™æ ·ä¸€æ¥ï¼Œå°±å¯ç¡®ä¿ä»»ä½•å¥½ç‰¹å¾éƒ½æ¯”"未知" 特å¾åˆé€‚。
ä¸é€‰æ‹©åªå¤„ç†æ–‡æ¡£æ•°æ®çš„特å¾ã€‚这是第一æ¡è§„则的æžç«¯ç‰ˆæœ¬ã€‚例如,å³ä½¿æŒ‡å®šåº”用是çƒé—¨ä¸‹è½½åº”ç”¨ï¼ˆæ— è®ºæŸ¥è¯¢æ˜¯ä»€ä¹ˆï¼‰ï¼Œæ‚¨ä¹Ÿä¸æƒ³åœ¨æ‰€æœ‰åœ°æ–¹éƒ½å±•ç¤ºå®ƒã€‚如果ä¸é€‰æ‹©åªå¤„ç†æ–‡æ¡£æ•°æ®çš„特å¾ï¼Œè¿™ä¸€ç‚¹å¾ˆå®¹æ˜“åšåˆ°ã€‚您之所以ä¸æƒ³åœ¨æ‰€æœ‰åœ°æ–¹å±•ç¤ºæŸä¸ªç‰¹å®šçš„çƒé—¨åº”ç”¨ï¼Œæ˜¯å› ä¸ºè®©ç”¨æˆ·å¯ä»¥æ‰¾åˆ°æ‰€æœ‰æ‰€éœ€åº”用至关é‡è¦ã€‚例如,如果一ä½ç”¨æˆ·æœç´¢"èµé¸Ÿåº”用",他/她å¯èƒ½ä¼šä¸‹è½½"愤怒的å°é¸Ÿ",但那ç»å¯¹ä¸æ˜¯ä»–/她想è¦çš„应用。展示æ¤ç±»åº”用å¯èƒ½ä¼šæ高下载率,但最终å´æœªèƒ½æ»¡è¶³ç”¨æˆ·çš„需求。
第36 æ¡è§„则:通过ä½ç½®ç‰¹å¾é¿å…出现å馈环。
内容的ä½ç½®ä¼šæžå¤§åœ°å½±å“用户与其互动的å¯èƒ½æ€§ã€‚如果您将应用放在首ä½ï¼Œåˆ™åº”用获得的点击率更高,导致您认为用户更有å¯èƒ½ç‚¹å‡»è¯¥åº”用。处ç†æ¤ç±»é—®é¢˜çš„一ç§æ–¹æ³•æ˜¯æ·»åŠ ä½ç½®ç‰¹å¾ï¼Œå³å…³äºŽå†…容在网页ä¸çš„ä½ç½®çš„特å¾ã€‚您å¯ä»¥ä½¿ç”¨ä½ç½®ç‰¹å¾è®ç»ƒæ¨¡åž‹ï¼Œä½¿æ¨¡åž‹å¦ä¹ (例如)对特å¾"1stposition" 赋予较高的æƒé‡ã€‚å› æ¤ï¼Œå¯¹äºŽå…·æœ‰"1stposition=true" 特å¾çš„æ ·æœ¬çš„å…¶ä»–å› ç´ ï¼Œæ¨¡åž‹ä¼šèµ‹äºˆè¾ƒä½Žçš„æƒé‡ã€‚然åŽï¼Œåœ¨åº”用时,您ä¸å‘任何实例æä¾›ä½ç½®ç‰¹å¾ï¼Œæˆ–为所有实例æ供相åŒçš„默认特å¾ï¼Œå› ä¸ºåœ¨å†³å®šä»¥æ€Žæ ·çš„é¡ºåºæ˜¾ç¤ºå€™é€‰å®žä¾‹ä¹‹å‰ï¼Œæ‚¨å°±å¯¹å…¶è¿›è¡Œäº†æ‰“分。
请注æ„ï¼Œå› ä¸ºè®ç»ƒå’Œæµ‹è¯•ä¹‹é—´çš„è¿™ç§ä¸å¯¹ç§°æ€§ï¼Œè¯·åŠ¡å¿…在ä½ç½®ç‰¹å¾ä¸Žæ¨¡åž‹çš„其余特å¾ä¹‹é—´ä¿æŒä¸€å®šçš„分离性。让模型æˆä¸ºä½ç½®ç‰¹å¾å‡½æ•°å’Œå…¶ä½™ç‰¹å¾å‡½æ•°ä¹‹å’Œæ˜¯ç†æƒ³çš„状æ€ã€‚例如,ä¸è¦å°†ä½ç½®ç‰¹å¾ä¸Žä»»ä½•æ–‡æ¡£ç‰¹å¾ç»„åˆåœ¨ä¸€èµ·ã€‚
第37 æ¡è§„则:测é‡è®ç»ƒ/ 应用å差。
一般æ¥è¯´ï¼Œå¾ˆå¤šæƒ…况都会引起å差。æ¤å¤–,您å¯ä»¥å°†å…¶åˆ†ä¸ºä»¥ä¸‹å‡ 个部分:
è®ç»ƒæ•°æ®å’Œé¢„留数æ®çš„效果之间的差异。一般æ¥è¯´ï¼Œè¿™ç§æƒ…况始终å˜åœ¨ï¼Œè€Œä¸”并éžæ€»æ˜¯å事。
预留数æ®å’Œ"次日" æ•°æ®çš„效果之间的差异。åŒæ ·ï¼Œè¿™ç§æƒ…况始终å˜åœ¨ã€‚您应该调整æ£åˆ™åŒ–,以最大程度地æå‡æ¬¡æ—¥æ•°æ®çš„效果。ä¸è¿‡ï¼Œå¦‚果与预留数æ®ç›¸æ¯”,次日数æ®æ•ˆæžœä¸‹é™æ˜Žæ˜¾ï¼Œåˆ™å¯èƒ½è¡¨æ˜ŽæŸäº›ç‰¹å¾å…·æœ‰æ—¶æ•ˆæ€§ï¼Œè€Œä¸”å¯èƒ½ä¼šé™ä½Žæ¨¡åž‹çš„效果。
"次日" æ•°æ®å’Œå®žæ—¶æ•°æ®çš„效果之间的差异。如果您将模型应用于è®ç»ƒæ•°æ®ä¸çš„æŸä¸ªæ ·æœ¬ï¼Œå¹¶åœ¨åº”用时使用åŒä¸€æ ·æœ¬ï¼Œé‚£ä¹ˆæ‚¨å¾—到的结果应该完全相åŒï¼ˆè¯·å‚阅第5 æ¡è§„åˆ™ï¼‰ã€‚å› æ¤ï¼Œæ¤å¤„的差异很å¯èƒ½è¡¨ç¤ºå‡ºçŽ°äº†å·¥ç¨‹é”™è¯¯ã€‚
ML第三阶段:缓慢增长ã€ä¼˜åŒ–细化和å¤æ‚模型
第二阶段å³å°†ç»“æŸæ—¶ä¼šå‡ºçŽ°ä¸€äº›ä¿¡å·ã€‚首先,月增长开始å‡å¼±ã€‚æ‚¨å°†å¼€å§‹åœ¨æŒ‡æ ‡ä¹‹é—´åšå‡ºå–èˆï¼šåœ¨éƒ¨åˆ†è¯•éªŒä¸ï¼Œæ‚¨ä¼šçœ‹åˆ°ä¸€äº›æŒ‡æ ‡ä¸Šå‡äº†ï¼Œè€Œå¦ä¸€äº›æŒ‡æ ‡ä¸‹é™äº†ã€‚情况å˜å¾—有趣起æ¥ã€‚由于越æ¥è¶Šéš¾å®žçŽ°å¢žé•¿ï¼Œå› æ¤ML系统必须å˜å¾—æ›´åŠ å¤æ‚。注æ„:相比之å‰ä¸¤ä¸ªéƒ¨åˆ†ï¼Œæœ¬éƒ¨åˆ†ä¸ä¼šæœ‰è¾ƒå¤šçš„纯ç†è®ºæ€§è§„则。我们è§è¿‡è®¸å¤šå›¢é˜Ÿåœ¨ML的第一阶段和第二阶段éžå¸¸æ»¡æ„。但到了第三阶段åŽï¼Œä»–们必须找到自己的é“路。
第38 æ¡è§„åˆ™ï¼šå¦‚æžœç›®æ ‡ä¸å调,并æˆä¸ºé—®é¢˜ï¼Œå°±ä¸è¦åœ¨æ–°ç‰¹å¾ä¸Šæµªè´¹æ—¶é—´ã€‚
当您的衡é‡ç»“果稳定时,您的团队会开始关注当å‰MLç³»ç»Ÿçš„ç›®æ ‡èŒƒå›´ä¹‹å¤–çš„é—®é¢˜ã€‚å¦‚å‰æ‰€è¿°ï¼Œå¦‚æžœçŽ°æœ‰ç®—æ³•ç›®æ ‡æœªæ¶µç›–äº§å“ç›®æ ‡ï¼Œåˆ™æ‚¨éœ€è¦ä¿®æ”¹ç®—æ³•ç›®æ ‡æˆ–äº§å“ç›®æ ‡ã€‚ä¾‹å¦‚ï¼Œæ‚¨å¯ä»¥ä¼˜åŒ–点击次数ã€+1次数或下载次数,但让å‘布决ç–部分ä¾èµ–于人工评分者。
第39 æ¡è§„则:å‘布决ç–代表的是长期产å“ç›®æ ‡ã€‚
Alice 有一个关于å‡å°‘预测安装次数的逻辑æŸå¤±çš„æƒ³æ³•ã€‚å¥¹æ·»åŠ äº†ä¸€ä¸ªç‰¹å¾ã€‚逻辑æŸå¤±é™ä½Žäº†ã€‚当她è¿è¡Œåœ¨çº¿å®žéªŒæ—¶ï¼Œçœ‹åˆ°å®‰è£…çŽ‡å¢žåŠ äº†ã€‚ä½†æ˜¯ï¼Œåœ¨å‘布评审会上,有人指出,æ¯æ—¥æ´»è·ƒç”¨æˆ·æ•°å‡å°‘了5%。于是,团队决定ä¸å‘布该模型。Alice 很失望,但现在她æ„识到å‘布决ç–å–决于多个æ¡ä»¶ï¼Œåªæœ‰ä¸€éƒ¨åˆ†æ¡ä»¶å¯ä»¥é€šè¿‡ML直接得到优化。
事实上,现实世界并ä¸æ˜¯ç½‘游世界:没有"生命值" æ¥ç¡®å®šäº§å“çš„è¿è¡ŒçŠ¶å†µã€‚团队必须使用自己收集的统计信æ¯æ¥å°è¯•æœ‰æ•ˆåœ°é¢„测系统未æ¥çš„表现会如何。他们需è¦å…³æ³¨äº’动度ã€æ—¥æ´»è·ƒç”¨æˆ·æ•°(DAU)ã€30 æ—¥DAUã€æ”¶å…¥ä»¥åŠå¹¿å‘Šä¸»çš„投资回报率。这些å¯åœ¨A/B 测试ä¸è¡¡é‡çš„æŒ‡æ ‡æœ¬èº«ä»…ä»£è¡¨äº†ä»¥ä¸‹æ›´é•¿æœŸç›®æ ‡ï¼šè®©ç”¨æˆ·æ»¡æ„ã€å¢žåŠ 用户数é‡ã€è®©åˆä½œä¼™ä¼´æ»¡æ„以åŠå®žçŽ°ç›ˆåˆ©ï¼Œè¿›ä¸€æ¥ï¼Œæ‚¨è¿˜å¯ä»¥è®¤ä¸ºå®ƒä»¬ä»£è¡¨äº†å‘布优质且实用的产å“,以åŠäº”å¹´åŽå…¬å¸ç¹è£å‘展。
唯一å¯ä»¥è½»æ¾åšå‡ºå‘布决ç–çš„æƒ…å†µæ˜¯ï¼Œæ‰€æœ‰æŒ‡æ ‡éƒ½åœ¨å˜å¥½ï¼ˆæˆ–至少没有å˜å·®ï¼‰ã€‚ 如果团队能够在å¤æ‚çš„ML算法和简å•çš„å¯å‘å¼ç®—法之间åšå‡ºé€‰æ‹©ï¼Œè€Œå¯¹æ‰€æœ‰è¿™äº›æŒ‡æ ‡æ¥è¯´ï¼Œç®€å•çš„å¯å‘å¼ç®—法å¯ä»¥æ供更好的效果,那么应该选择å¯å‘å¼ç®—法。æ¤å¤–,并未对所有å¯èƒ½çš„æŒ‡æ ‡å€¼è¿›è¡Œæ˜Žç¡®æŽ’å。具体而言,请考虑以下两ç§æƒ…形:
A | 100 万 | 400 万美元 |
B | 200 万 | 200 万美元 |
如果当å‰ç³»ç»Ÿæ˜¯A,那么团队ä¸å¤ªå¯èƒ½ä¼šæ”¹ç”¨B。如果当å‰ç³»ç»Ÿæ˜¯B,那么团队ä¸å¤ªå¯èƒ½ä¼šæ”¹ç”¨A。这似乎与ç†æ€§è¡Œä¸ºèƒŒé“è€Œé©°ï¼›ä½†æ˜¯ï¼Œå¯¹æ›´æ”¹æŒ‡æ ‡çš„é¢„æµ‹å¯èƒ½ä¼šæˆåŠŸä¹Ÿå¯èƒ½ä¸ä¼šï¼Œå› æ¤è¿™ä¸¤ç§æ”¹å˜éƒ½è•´å«ç€å·¨å¤§çš„风险。æ¯ä¸ªæŒ‡æ ‡éƒ½æ¶µç›–了团队所担心的一些风险。
æ¤å¤–ï¼Œæ²¡æœ‰ä¸€ä¸ªæŒ‡æ ‡æ¶µç›–å›¢é˜Ÿæœ€å…³å¿ƒçš„é—®é¢˜ï¼Œå³"五年åŽæˆ‘的产å“将何去何从"?
å¦ä¸€æ–¹é¢ï¼Œä¸ªäººæ›´å€¾å‘于选择å¯ä»¥ç›´æŽ¥ä¼˜åŒ–çš„ç›®æ ‡ã€‚ 大多数ML工具也都é’çè¿™æ ·çš„çŽ¯å¢ƒã€‚åœ¨è¿™æ ·çš„çŽ¯å¢ƒä¸‹ï¼Œå¿«é€Ÿåˆ›å»ºæ–°ç‰¹å¾çš„工程师能稳定地进行一系列å‘布。一ç§ç§°ä¸º"å¤šç›®æ ‡å¦ä¹ " çš„ML已开始解决æ¤é—®é¢˜ã€‚例如,您å¯ä»¥æ出约æŸæ»¡è¶³é—®é¢˜ï¼Œå¯¹æ¯ä¸ªæŒ‡æ ‡è®¾å®šä¸‹é™ï¼Œå¹¶ä¼˜åŒ–æŒ‡æ ‡çš„ä¸€äº›çº¿æ€§ç»„åˆã€‚ä¸è¿‡ï¼Œå³ä½¿å¦‚æ¤ï¼Œä¹Ÿå¹¶ä¸æ˜¯æ‰€æœ‰æŒ‡æ ‡éƒ½å¯ä»¥è½»æ¾æ¡†å®šä¸ºMLç›®æ ‡ï¼šå¦‚æžœç”¨æˆ·ç‚¹å‡»äº†æ–‡æ¡£æˆ–å®‰è£…äº†åº”ç”¨ï¼Œé‚£æ˜¯å› ä¸ºç›¸åº”å†…å®¹å±•ç¤ºå‡ºæ¥äº†ã€‚但è¦å¼„清楚用户为什么访问您的网站就难得多。如何预测整个网站未æ¥çš„æˆåŠŸçŠ¶å†µå±žäºŽAI完备问题:与计算机视觉或自然è¯è¨€å¤„ç†ä¸€æ ·éš¾ã€‚
第40 æ¡è§„则:ä¿è¯é›†æˆå¦ä¹ 简å•åŒ–。
采用原始特å¾å¹¶ç›´æŽ¥å¯¹å†…容进行排å的统一模型是最易于进行调试和ç†è§£çš„模型。但是,集æˆå¦ä¹ 模型(将其他模型的分数结åˆåˆ°ä¸€èµ·çš„模型)å¯ä»¥å®žçŽ°æ›´å¥½çš„效果。为了简å•èµ·è§ï¼Œæ¯ä¸ªæ¨¡åž‹åº”该è¦ä¹ˆæ˜¯ä»…接å—其他模型的输入的集æˆå¦ä¹ 模型,è¦ä¹ˆæ˜¯æŽ¥å—多个特å¾çš„基本模型,但ä¸èƒ½ä¸¤è€…皆是。 如果在å•ç‹¬è®ç»ƒçš„模型之上还有其他模型,则组åˆå®ƒä»¬ä¼šå¯¼è‡´ä¸è‰¯è¡Œä¸ºã€‚
使用简å•çš„模型进行集æˆå¦ä¹ (仅将"基本" 模型的输出作为输入)。æ¤å¤–,您还需è¦å°†å±žæ€§å¼ºåŠ 到这些集æˆå¦ä¹ 模型上。例如,基本模型生æˆçš„分数的å‡é«˜ä¸åº”使集æˆå¦ä¹ 模型的分数有所é™ä½Žã€‚å¦å¤–ï¼Œå¦‚æžœä¼ å…¥çš„æ¨¡åž‹åœ¨è¯ä¹‰ä¸Šå¯è§£é‡Šï¼ˆä¾‹å¦‚,ç»è¿‡æ ¡å‡†ï¼‰ï¼Œåˆ™æœ€ç†æƒ³ï¼Œå› ä¸ºè¿™æ ·ä¸€æ¥ï¼Œå³ä½¿åŸºæœ¬æ¨¡åž‹å‘生改å˜ï¼Œä¹Ÿä¸ä¼šæ‰°ä¹±é›†æˆå¦ä¹ 模型。å¦å¤–,强制è¦æ±‚:如果基本分类器的预测概率增大,ä¸ä¼šä½¿é›†æˆå¦ä¹ 模型的预测概率é™ä½Žã€‚
第41 æ¡è§„则:效果达到平稳åŽï¼Œå¯»æ‰¾ä¸ŽçŽ°æœ‰ä¿¡å·æœ‰è´¨çš„差别的新信æ¯æºå¹¶æ·»åŠ è¿›æ¥ï¼Œè€Œä¸æ˜¯ä¼˜åŒ–现有信å·ã€‚
æ‚¨æ·»åŠ äº†ä¸€äº›æœ‰å…³ç”¨æˆ·çš„å—众特å¾ä¿¡æ¯ï¼Œä¹Ÿæ·»åŠ 了一些有关文档ä¸å—è¯çš„ä¿¡æ¯ã€‚您探索了模æ¿ï¼Œå¹¶è°ƒæ•´äº†æ£åˆ™åŒ–ã€‚ä½†åœ¨å‡ ä¸ªå£åº¦çš„å‘布ä¸ï¼Œå…³é”®æŒ‡æ ‡çš„æå‡å¹…度从æ¥æ²¡æœ‰è¶…过1%。现在该怎么办?
是时候开始为截然ä¸åŒçš„特å¾ï¼ˆä¾‹å¦‚,用户在过去一天内ã€ä¸€å‘¨å†…或一年内访问的文档的历å²è®°å½•ï¼Œæˆ–者其他属性的数æ®ï¼‰æž„建基础架构了。您å¯ä»¥ä½¿ç”¨ç»´åŸºæ•°æ®æ¡ç›®æˆ–å…¬å¸å†…部信æ¯ï¼ˆä¾‹å¦‚,Google 的知识图谱)。利用深度å¦ä¹ 。开始调整您对投资回报的预期,并付出相应的努力。与在任何工程项目ä¸ä¸€æ ·ï¼Œæ‚¨å¿…é¡»å¯¹æ·»åŠ æ–°ç‰¹å¾çš„å¥½å¤„ä¸Žå¢žåŠ å¤æ‚性的æˆæœ¬è¿›è¡Œä¸€ç•ªæƒè¡¡ã€‚
第42 æ¡è§„则:ä¸è¦æœŸæœ›å¤šæ ·æ€§ã€ä¸ªæ€§åŒ–或相关性与çƒé—¨ç¨‹åº¦ä¹‹é—´çš„è”ç³»æœ‰æ‚¨è®¤ä¸ºçš„é‚£æ ·å¯†åˆ‡ã€‚
一组内容ä¸çš„å¤šæ ·æ€§å¯ä»¥æœ‰å¤šç§å«ä¹‰ï¼Œå…¶ä¸å†…容æ¥æºçš„å¤šæ ·æ€§æ˜¯æœ€å¸¸è§çš„一ç§ã€‚个性化æ„味ç€æ¯ä¸ªç”¨æˆ·èŽ·å¾—è´´åˆå…¶ä¸ªäººéœ€æ±‚的结果。相关性æ„味ç€æŸä¸ªç‰¹å®šæŸ¥è¯¢çš„结果更适åˆè¯¥æŸ¥è¯¢ï¼Œè€Œéžå…¶ä»–ä»»ä½•æŸ¥è¯¢ã€‚å› æ¤ï¼Œè¿™ä¸‰ä¸ªå±žæ€§å‡å…·æœ‰ä¸åŒäºŽå¸¸æ€çš„定义。
但常æ€å¾€å¾€å¾ˆéš¾è¢«æ‰“败。
请注æ„,如果您的系统在测é‡ç‚¹å‡»æ¬¡æ•°ã€è®¿é—®æ—¶é—´ã€è§‚看次数ã€+1 次数ã€è½¬å‘次数ç‰æ•°æ®ï¼Œé‚£ä¹ˆæ‚¨æµ‹é‡çš„是内容的çƒé—¨ç¨‹åº¦ã€‚团队有时会å°è¯•å¦ä¹ å…·å¤‡å¤šæ ·æ€§çš„ä¸ªæ€§åŒ–æ¨¡åž‹ã€‚ä¸ºå®žçŽ°ä¸ªæ€§åŒ–ï¼Œä»–ä»¬ä¼šæ·»åŠ æ”¯æŒç³»ç»Ÿè¿›è¡Œä¸ªæ€§åŒ–(代表用户兴趣的部分特å¾ï¼‰æˆ–å¤šæ ·åŒ–ï¼ˆè¡¨æ˜Žç›¸åº”æ–‡æ¡£æ˜¯å¦ä¸Žå…¶ä»–返回的文档有任何相åŒç‰¹å¾çš„特å¾ï¼Œä¾‹å¦‚作者或内容)的特å¾ï¼Œç„¶åŽå‘现这些特å¾çš„æƒé‡æ¯”预期低(或者有时是ä¸åŒçš„ä¿¡å·ï¼‰ã€‚
这并ä¸æ„味ç€å¤šæ ·æ€§ã€ä¸ªæ€§åŒ–或相关性ä¸é‡è¦ã€‚æ£å¦‚上一æ¡è§„则ä¸æ‰€æŒ‡å‡ºçš„é‚£æ ·ï¼Œæ‚¨å¯ä»¥è¿›è¡ŒåŽæœŸå¤„ç†æ¥å¢žåŠ å¤šæ ·æ€§æˆ–ç›¸å…³æ€§ã€‚å¦‚æžœæ‚¨çœ‹åˆ°æ›´é•¿æœŸçš„ç›®æ ‡æœ‰æ‰€å¢žé•¿ï¼Œæ‚¨å¯ä»¥å£°æ˜Žé™¤äº†çƒé—¨ç¨‹åº¦å¤–ï¼Œå¤šæ ·æ€§/相关性也很有价值。然åŽï¼Œæ‚¨å¯ä»¥ç»§ç»é‡‡ç”¨åŽæœŸå¤„ç†æ–¹æ³•ï¼Œä¹Ÿå¯ä»¥æ ¹æ®å¤šæ ·æ€§æˆ–ç›¸å…³æ€§ç›´æŽ¥ä¿®æ”¹ç›®æ ‡ã€‚
第43 æ¡è§„则:在ä¸åŒçš„产å“ä¸ï¼Œæ‚¨çš„好å‹åŸºæœ¬ä¿æŒä¸å˜ï¼Œä½†æ‚¨çš„兴趣并éžå¦‚æ¤ã€‚
Google 的团队通过以下åšæ³•å–得了大é‡è¿›å±•ï¼šé‡‡ç”¨ä¸€ä¸ªé¢„测产å“ä¸æŸç§è”系的紧密程度的模型,并使用该模型对其他产å“进行准确预测。您的好å‹ä¿æŒä¸å˜ã€‚å¦ä¸€æ–¹é¢ï¼Œæˆ‘曾è§è¿‡å‡ 个团队在应对多个产å“间的个性化特å¾æ—¶æ‰è¥Ÿè§è‚˜ã€‚是的,当时看起æ¥åº”该å¯ä»¥å¥æ•ˆçš„。但现在看æ¥å¹¶æ²¡æœ‰ã€‚有时å¯ä»¥å¥æ•ˆçš„方法是,使用一个属性的原始数æ®æ¥é¢„测å¦ä¸€ä¸ªå±žæ€§çš„行为。æ¤å¤–,请注æ„,仅仅是知é“用户有其他属性的历å²è®°å½•ä¹Ÿä¼šæœ‰å¸®åŠ©ã€‚例如,两个产å“上出现了用户活动或许本身就å¯ä»¥è¯´æ˜Žè¯¥é—®é¢˜ã€‚
Battery holders are key components in many types of portable and anchored electrical items - and you`ll find just as many varieties, brands, sizes and types of battery holder available across global markets as there are different sorts of battery.
In this guide, we`ll look more closely at some of the various mounting styles a battery holder might come fitted with, as well as the many different cell sizes they`re designed to work with.
What is a battery holder?
Battery Holders & Mounts
A battery holder is most commonly sold as an integral or removable compartment or cavity, designed to be inserted into - or attached onto - a suitable item of cell-powered equipment.
The primary function of a battery holder is to keep cells fixed in place safely and securely while conveying power from the batteries to the device in question. External connections on battery holders are most often made by contacts either with pins, surface mount feet, soldered lugs or via a set of wire leads.
Battery holders are often designed to be incorporated within the body of an electrical item, but they`re also frequently sold as external compartments or attachments. Either way, some of the most important factors to consider when choosing an appropriate battery holder for a given application will be:
Battery size
Compatible cell types
Contact/terminal style
Battery mounting and insertion method
Unit size and shape
Battery holder cover and housing design
Build quality and manufacturer reputation
Holders and mounts for different battery sizes
Many different battery holder sizes and mounting types are available from manufacturers and suppliers all over the world. Most will be made to work with a particular sort of cell - or sometimes with a range of multiple battery sizes - and to be affixed to the electrical item in question in a specific way.
Bear in mind that, in addition to sorting through the wide variety of battery holder sizes and capacities on the market, you`ll also need to be mindful of the many options available in terms of actual battery chemistry.
While the vast majority of holders and mounts will be able to accommodate most battery chemistry types, each will have its own unique set of pros and cons in terms of performance, power drain, longevity, cost and environmental impact.
Common battery types sold by many suppliers might include:
Alkaline batteries
Lithium batteries
Rechargeable batteries (including li-ion)
Nickel cadmium batteries
Nickel metal hydride batteries
NiZN batteries
The above range of cell types is often available across a broad range of popular battery sizes and shapes, including both AA and AAA versions. Most, but not all, will be interchangeable in a wide range of devices, holders and mounts. However, it`s seldom advisable to combine mixed battery chemistries in a single electronic device.
Doing so can often impede device performance, as the item drawing power will only ever perform to the capabilities of the weakest battery in the holder. Of even greater concern is the fact that more powerful batteries can sometimes force weaker ones to work harder than they`re intended to, causing them to drain to very low voltage levels and possibly leading to potential leaking or rupture.
As such, it`s always wise to ensure you know which cell types you`re likely to be using in advance of making a purchase, as well as the correct sizing you`ll require.
AA Battery Holders & Mounts
An AA size battery holder is one of the more common types of battery storage compartments you`ll see on sale from UK suppliers. AA size (`double-A`), is what most people would typically think of as a `standard` battery, and remains extremely common in all kinds of portable electrical items and equipment.
AA battery pack holders can be manufactured to house any number of cells in theory, but in daily use it`s usually anywhere between 1-10 batteries, with 2-8 being by far the most popular options. AA single battery holders are also common.
Relatively few cell-powered tools or other pieces of electrical equipment will be designed to require more than 8 AA batteries for achieving sufficient power supply. Items that demand a higher power draw than this will usually be built around a larger cell type, for improved practicality and less frequent need for battery replacement or recharging.
AAA Battery Holders and Mounts
Battery holders for AAA cells, commonly referred to as `triple-A` are another very common configuration. AAA batteries are smaller and slimmer than AA versions, with around a third of the storage capacity.
Due to the increasing efficiency of many device types, AAA batteries are often found in more compact pieces of equipment that would typically have been expected to run off double-A cells in years gone by. Today, these will very often include low-drain portable or handheld electronic goods or tools, such as:
Remote controls
Mp3 players, handheld gaming systems and other portable entertainment devices
Digital cameras
Cordless mice or keyboards
Flashlights and other portable lighting units
Clocks and alarms
Kitchen gadgets
AAA battery holders are generally built to accommodate fewer individual cells than bigger units for housing AA and larger capacity batteries, as devices requiring more power will typically not run off triple-A cells.
An AAAA battery holder is a somewhat less common configuration, given that the much smaller than average AAAA (`quadruple-A` or `four-A`) cell standard drains considerably faster than AA or AAA equivalents.
C Battery Holders and Mounts
C size battery holders are designed around the fairly common C battery (sometimes known as an R14). This is a fairly wide diameter cylindrical cell, smaller than D batteries while being the same height - but significantly wider - than AA standard.
C cell batteries, holders and mounts are less widely used than either AA or D versions, particularly since power efficiency is continually improving across most cell-powered devices. However, they can still often be found on a range of medium-drain electronic items, including:
Heavier flashlights and smaller ambient lighting fixtures (e.g. table lamps)
Radios and portable speakers
Toys and games Musical instruments and amplifiers
D Battery Holders and Mounts
D batteries are among the larger types of `everyday use` cells you`ll commonly encounter around the home or workplace. D size batteries tend to be used for higher-drain devices that tend to be used for longer periods at a time, including:
High Power torches and flashlights
Safety systems
Geiger counters
Megaphones, larger musical instruments and more powerful audio devices or speakers
Motorised toys and portable gaming consoles
Products with inbuilt electric motors
Battery holders for D cells are most commonly sold in configurations designed to house either 2 or 4 individual batteries.
12V Battery Holders and Mounts
Battery holders for 12V cells are somewhat less widely available, chiefly because the 12V standard incorporates such a wide range of battery shapes and sizes, although 12V Battery Contacts themselves are more commonly sold as separate components (see Battery holder mounts, below).
The precise sort of 12V battery you might need for specific applications will dictate the appropriate form factor, which can be anything from a large heavy car battery type to one closely resembling a traditional AA cell.
With larger 12V batteries, in particular, it`s less common to need more than one at a time, and they`re generally built more intrinsically into the device or equipment being powered than other types of `snap in/out` batteries designed for frequent and quick replacement. This means that 12V battery holders and mounts aren`t as frequently sold by many UK suppliers as versions for other, more common cell types.
Battery holder mounts
Battery holder mounts usually refer to the way the battery holder itself is slotted, secured or attached onto or into the main body of the item being powered. Note, however, that the term `battery mount` is also used to refer to the method by which individual cells are held in place within the overall holder unit.
Battery holder mounts and battery mounts are usually present as an integral part of a complete holder unit, but you can also buy battery mounts cheaply as swap-out or replacement parts. They can come in many different configurations and designs, with some of the more common types being:
Chassis mount
Panel mount
PCB mount
Slide-in mount (usually refers to the battery insertion method)
Snap-in mount (usually refers to the battery insertion method)
Strap & lead mount (a battery connector, rather than a holder per se)
Surface mount
Through-hole mount
Wire lead mount (refers to connection type)
In this section, we`ll look at some of the most widely used battery mount types in everyday use
Chassis Mount Battery Holders
A chassis mount battery holder denotes one intended to be affixed to a surface plate within the main body of the device being powered, and will usually remain in place while batteries are being swapped. A chassis mount holder is usually attached either via fixing holes in the back of the casing - i.e. a through-hole mount - or via tabs located around the sides of the battery holder.
Chassis mount battery holders can be either metal or plastic, open faced or enclosed (i.e. with or without a cover), and are commonly designed to accept a wide range of cell sizes including AA, AAA, C, D and 9V PP3
Panel Mount Battery Holders
A panel mount battery holder is designed to be slotted into a pre-existing cavity in the side, top or back exterior panel of a device, and removed entirely as a single self-contained unit when it`s time to swap out the batteries. They`re often manufactured such that the cover of the battery holder will sit flush with the device`s panel once inserted
Panel mount battery holders are most commonly secured in place via one of two methods - either via a screw-in system for optimal stability or with a flange for easier removal and reinsertion.
PCB Mount Battery Holders
As the name implies, a PCB mount battery holder is intended for applications where the battery holder needs to be installed directly onto a printed circuit board
The most common design is for the PCB mount holder to include several small sharp pins around its perimeter - these are pushed directly through the surface of the PCB and then soldered on from the rear side
Holders for PCB mounted batteries can be made from metal or plastic. While they`re frequently built to accommodate single coin cells, such as you might find on a typical computer motherboard, they can also be configured to allow for multiple cells and other battery types, including cylindrical AAA, AA, C, or D cells
Slide In Mount Battery Holders
As noted above, a slide-in mount battery holder will generally refer to the way the individual cells are inserted into and removed from the battery holder as a whole.
Slide-in mounts are arguably the most common type of battery connection/terminal setup, requiring the user simply to `drop` the cells into place either lengthways or side-on, at which point they`re either held in place by opposing forces from the two terminals, or secured with a cover or cap
Slide-in battery mounts tend to be among the cheapest and most cost-effective choices to buy in the UK for quickly and simply inserting a cell into an appropriate holder. They work reliably well, and are most often seen in molded plastic or metal battery holders incorporating self-contained battery compartments.
Snap-In Mount Battery Holders
A snap-in mount battery holder (sometimes called snap-on, depending on the manufacturer or supplier) denotes a terminal/contact setup where the cell is held more securely in place.
This is usually achieved using tensioned metal clips, sometimes incorporating springs, through which the battery needs to be pushed with a small amount of force before it clicks into place ready for use. Snap-in battery mounts give the device greater protection against accidental cell power loss during movement, vibration and shock
The term `snap-in` or `snap-on` is also widely used to refer to the standard contact types used on 9V cells (see Strap and lead battery holders, below).
Strap And Lead Battery Holders
Strap and lead battery holders aren`t actually holders per se, but rather a method of directly attaching a freestanding cell to an electronic device or motor without the use of an entire housing unit.
They generally consist of a wire, which is designed to be hooked up to the circuitry of the device via tinned ends, combined with a flat plastic terminal pad that attaches to the contacts of a battery via two press studs
The strap and lead battery holder type is most often used with 9V PP3 cells, allowing the cell to be lifted clear away from the main body of the item for removal, replacement or reattachment.
Surface Mount Battery Holders
A surface mount battery holder is designed to sit flat on any internal or external panel or plate, either around or inside the device being powered, and is generally open-faced.
Holders of this type are most often seen as a mounting solution for coin-type batteries with button terminal contacts, in applications where the cell doesn`t need to be connected directly to a PCB.
Types of battery holders
Once the correct size and mount style of battery holder has been chosen for your intended application and device, it`s also important to be aware of the different physical designs available for various battery holder types. In the section below, we`ll outline some of the more common versions.
Clip battery holders
A battery holder clip is a simple, economical choice for mounting batteries securely within a holder. They tend to consist of two metal arms, arranged in a loop with a small gap at the apex, through which the cells are pushed and subsequently held in place. Clip battery holders tend to be most often associated with common cylindrical batteries such as AA and AAA, but are also sold for securing C, D and 9V PP3 cells into position.
Contact battery holders
Contact battery holders refer to the simple tension-type terminal contact plates and accessories (including coil springs, button, top spring arm and leaf spring contact types) that hold a battery in place while allowing it to deliver power to the device drawing from it.
Again, these are cheap and cost-effective components widely available as separate swap-in or replacement parts that can be used to fit a very wide range of different cell sizes and battery holder types.
Through Hole Surface Mount Battery Holders
A through-hole surface mount battery holder is a common variation on the type outlined above, except it`s affixed to the surface of the device in question via a hole in the housing rather than via tabs around the perimeter. This makes the through-hole version better suited to applications where space is at a premium.
Wire Lead Mount Battery Holders
Wire lead mount battery holders denote a particular type of external connection between the battery holder itself and the circuitry of the device being powered. A wire lead mount battery holder will be supplied with two leads attached to the holder at one end, with the other ends free to be hooked into the circuitry of the device as the holder is attached.
This can offer greater flexibility in terms of battery holder mounting locations than certain pin or foot connection types, which must make immediate contact with the circuit wherever the holder is positioned.
Battery Holders
ATKCONN ELECTRONICS CO., LTD , https://www.atkconn.com