A Fast Sanity Check for Clustering Answers
Why can clustering results vary a lot across random seeds even on the same dataset?
打开 →GLOBAL SEARCH
搜索在服务端完成,题目解析与答案不会进入搜索结果。登录后可搜索自己的收藏题单。
找到 28 个结果
中文题目Why can clustering results vary a lot across random seeds even on the same dataset?
打开 →Why can clustering raw price levels of stocks be misleading compared with clustering normalized returns or features?
打开 →A researcher generates 240 heavily correlated strategy variants but argues they amount to only 24 effectively independent families. If the desk still flags any family with p-value below 8%, what is the approximate probability of at least one false family-level winner under the nu
打开 →A point in clustering has average within-cluster distance a=2 and smallest average distance to another cluster b=5. What is its silhouette score?
打开 →For single-linkage hierarchical clustering on 1D points {0,1,4}, which pair merges first?
打开 →A 2D k-means run has centroids at (0,0) and (4,0). For point x=(1,2), which centroid is closer in the assignment step?
打开 →Cluster A has size 3 and mean 0, cluster B has size 1 and mean 2, cluster C has size 1 and mean 5. Under Ward linkage, which pair merges first: A-B or B-C?
打开 →In DBSCAN with minPts=4, point A has 4 neighbors, point B has 3 neighbors, and B lies within eps of A. What status does B get?
打开 →A 1D cluster currently contains points [1,2,7]. What centroid does k-means assign to that cluster in the update step?
打开 →A 2D cluster contains points (0,0), (2,4), and (4,2). What centroid does k-means assign to that cluster?
打开 →A 2-component GMM has priors 0.6 and 0.4. At point x, the component likelihoods are 0.1 and 0.3. What posterior responsibility does component 2 get?
打开 →A DBSCAN run uses minPts=5, counting the point itself. A point has 5 neighbors within eps. Is it a core point?
打开 →In 1D, clusters A={0,2} and B={5,9}. What is their average-linkage distance?
打开 →A 1D k-means step has current centroids 0 and 10, and the assigned points are {1,2} to the first cluster and {9} to the second. What are the updated centroids?
打开 →A cluster contains points [1,2,4]. What is its within-cluster SSE around the cluster mean?
打开 →Under Ward linkage, what merge cost increase do two singleton clusters at positions 1 and 4 have in 1D?
打开 →Clusters A={1,4} and B={6,8} are compared under complete linkage in 1D. What distance is used?
打开 →In a 2-component GMM with equal priors, component densities at x are 0.12 and 0.08. Which component gets higher responsibility on the first E-step?
打开 →Why can k-means behave poorly when one feature has a much larger scale than the others?
打开 →Why can silhouette score prefer a smaller k even when larger k lowers SSE?
打开 →You expect non-convex moon-shaped clusters plus some noise points. Would you try k-means or DBSCAN first, and why?
打开 →A DBSCAN point has 4 neighbors within eps when minPts=5, so it is not core. If eps is increased and the neighborhood count becomes 6, what status change occurs?
打开 →A 1D cluster has members [0,2,4], so its centroid is 2. If an outlier 10 is added to the cluster, by how much does the centroid shift?
打开 →A point has silhouette inputs a=3 and b=5. If b falls to 4 while a stays fixed, how much does the silhouette score decrease?
打开 →In a 2-component GMM with equal priors, component 1 has likelihood 0.2 at x while component 2 has likelihood 0.1, so responsibility_1 is 2/3. If component 2's likelihood rises to 0.2, what does responsibility_1 become?
打开 →The Euclidean distance between (0,0) and (3,40) is computed on raw features. If the second feature is standardized by dividing it by 10 while the first is unchanged, what new distance do you get?
打开 →Why can clustering grid points around the strike improve gamma estimates more than simply extending S_max farther out?
打开 →Why can square-root-of-time VaR scaling fail badly during volatility clustering?
打开 →