frameworks_base

Author	SHA1	Message	Date
Yangster-mac	9def8e3995	Reduce statsd log data size. 1. Hash the strings in metric dimensions. 2. Optimize the timestamp encoding in bucket. Use bucket num for full bucket and millis for partial bucket. 3. Encode the dimension path per metric and avoid deduping it across dimensons. Test: statsd test Change-Id: I18f69654de85edb21a9c835c73edead756295e05 BUG: b/77813755	2018-04-26 04:30:18 -07:00
Chenjie Yu	e36018b272	add dump report reason to reports + also change uidmapping version numbers to int64_t Bug: 78132855 Change-Id: Iac7ea93e4bf651bd65bd03383e7ab4971af4fc29 Fix: 78132855 Test: gts test	2018-04-18 20:19:21 +00:00
Yao Chen	163d2602db	Handle logd reconnect. When statsd reconnects to logd, statsd will read all logs from buffer again. To prevent us from reprocessing old events, we do the following: 1. At any given moment, record the largest timestamp(T_max) and last timestamp (check point) that we've seen before. 2. When reconnection happens, we look for the check point until we see a new log with a timestamp larger than T_max. -> If we found the CP, resume after the CP. Success -> If we can't find CP, there is definitely log loss. We reset all configs. Note: 1. Logd has an API to read logs after a certain timestamp. But this api is vulnerable to time changes from Settings. So we cannot rely on it. 2. If logd inserts a new log (with older timestamp) before CP, we cannot detect it. It's not possible to detect it without record all timestamps we have seen. Test: statsd_test Bug: 77813113 Change-Id: Ic3fdb47230807606ab11dc994cb162194adb8448	2018-04-10 22:06:03 -07:00
Yangster-mac	e68f3a5811	Flush the partial bucket when startd shuts down or config updated. Test: statsd test BUG: b/77556036 Change-Id: Ie4a04ace55e07c4529cdff5906ba874f8815f620	2018-04-05 18:05:57 -07:00
David Chen	bd12527c90	Fix uid map to be simpler and fix partial bucket. The previous scheme captured periodic snapshots for each config with complex logic that's unnecessary and wasted memory. We actually don't need to store any snapshots since we just convert the current state into a snapshot and also include the deltas (change events) since the previous report until now. To make the system more robust, we also include up to 100 of the deleted apps in the uid map. Also, fix the wiring of the partial buckets so the metric producers form partial buckets on both app upgrade and removal, but not on installation of a new app. Also, we update StatsCompanionService to also include disabled apps. Bug: 77607583 Test: Verified unit-tests pass and added new e2e tests. Change-Id: I98e1f544d6e6571545ae1581c4cebab807596f51	2018-04-05 16:15:01 -07:00
Yangster-mac	b142cc8add	Statsd config TTL Roughly check the config every hour to see whether the ttl expired. If so, read the config from disk and recreate the metric manager. Test: statsd test BUG: b/77274363 Change-Id: I16838afe5bbe966c3a0f638869751f9b59a5a259	2018-04-04 15:59:43 +00:00
David Chen	faa1af535b	Includes annotations with statsd reports. It's tricky to determine the source of the metrics on a device currently since we can take the union of multiple configs and send only one giant statsd_config into statsd. We will use the int64 field to track the sub config id's and the int32 field to track the version for each sub config, but the fields are named more generically as annotations. The annotations are available in both the reports and metadata. Test: Check that all unit-tests pass on marlin-eng Bug: 77327261 Change-Id: Ic37c549c8b2991676f69948c515156765c9f5108	2018-04-03 18:20:40 -07:00
Yangster-mac	c04feba805	Move forward the alarm timestamp when config is added to statsd. Test: statsd test BUG: b/77344187 Change-Id: Ieacffaa29422829b8956f2b3fcb2c647c8c3eed9	2018-04-02 18:12:36 -07:00
David Chen	35045cbc34	Fix uidmap in statsd. Previously tried an optimization that results in corrupted proto output. This changes to a safer approach of storing the snapshot data in memory and only converting to proto output when the ProtoOutputStream is provided. Also fixes a security issue when trying to invoke triggerUidSnapshot since we forgot to use SCS' permissions. Test: Added a unit-test to verify output of StatsLogProcessor. Bug: 76231867 Change-Id: Id410ce3505fda9d71caa71942ef3068b55872c66	2018-03-24 15:01:04 -07:00
Yao Chen	06dba5d79c	Add API to let metrics directly drop data without writing to an output. + Metrics will do flushIfNeeded() to correctly move the clock and informing AnomalyTracker the past bucket info, and then clear past buckets. + We will still keep the current bucket data for the validity of the future metrics. Bug: 70571383 Test: statsd_test Change-Id: Ib13c45574974e7b4e82bd8f305091dc93bda76f5	2018-03-01 15:22:55 -08:00
Yangster-mac	932ececa16	Alarm: wakes up statsd and notifies the subscribers. Test: manually tested it. Change-Id: Id796a68976aeb1611183023ba4e9c6a8b8c44bb8	2018-02-27 13:30:48 -08:00
Yao Chen	8a8d16ceea	Statsd CPU optimization. The key change is to revamp how we parse/store/match a log event, especially how we match repeated field and attribution nodes, and how we construct dimensions and compare them. + We use a integer to encode the field of a log element. And also encode the FieldMatcher into an integer and a bit mask. The log matching becomes 2 integer operations. + Dimension is stored as encoded field and value pair. Checking if 2 dimensions are equal is then becoming checking if the underlying integers are equal. The integers are stored contiguously in memory, so it's much faster than previous tree structure. Start review from FieldValue.h Test: statsd_test + new unit tests Bug: 72659059 Change-Id: Iec8daeacdd3f39ab297c10ab9cd7b710a9c42e86	2018-02-12 10:38:45 -08:00
Yangster-mac	b0d0628a29	Thread-safety at log processor level. Test: statsd unit test passed. Change-Id: Ibe8c8d3cc8297875b16ee385c077b71c87353147	2018-01-08 14:59:42 -08:00
Yangster-mac	94e197cceb	1/ Change all "name" to id in statsD. 2/ Handle Subscription for alert. 3/ Support no_report_metric Bug: 69522276 Test: all statsd unit tests passed. Change-Id: I851b235f2d149b8602b0cad632d5bf541962f40a	2018-01-03 15:34:00 -08:00
Yangster-mac	2087716f2b	1/ Support nested message and repeated fields in statsd. 2/ Filter gauge fields by FieldMatcher. 3/ Wire up wakelock attribution chain. 4/ e2e test: wakelock duration metric with aggregated predicate dimensions. 5/ e2e test: count metric with multiple metric condition links for 2 predicates and 1 non-sliced predicate. Test: statsd unit test passed. Change-Id: I89db31cb068184a54e0a892fad710966d3127bc9	2018-01-01 10:01:36 -08:00
Yao Chen	d10f7b1c7b	Add log source filtering in statsd to filter out spams. + Add log source whitelist in StatsdConfig + Some changes in UidMap API. Listener needs to be wp instead of sp. + Update dogfood app config to have log source + Increase the stats service thread pool size to 10 (9+1). TODO: add unit tests(b/70805664). This unit test takes some time to write. Test: statsd_test & manual Change-Id: I129b1cc13db5114db7417580962bd7cc4438519d	2017-12-20 18:45:43 -08:00
Chenjie Yu	85ed838713	align metrics to 5min bundary We use one alarm clock for all pulled atoms. If metrics from different configs are not aligned, the clock will be set to repeat at higher and higher frequency, and consume a lot of battery. Current implementation assumes a 5min minimum bucket size. New metric start time is set to be aligned to the start time of statsd in the next 5min. So it will ignore events up to 5min. align puller alarm to minute bundary Test: unit test Change-Id: I77ffa3c13de363c780b1000181b9a9b780dd0846	2017-12-16 17:08:46 -08:00
Yao Chen	288c600013	Only create ProtoOutputStream when onGetData() is called. The exception is EventMetricProducer. Each EventMetricProducer will still have a ProtoOutputStream Because LogEvent comes as a fixed 4K, it's more memory efficient to have an 8k ProtoOutputStream for storing the events. Also removed finish() api in MetricProducer, which was intended to use with Dropbox. Test: statsd_test & dogfood app Bug: 70393808 Change-Id: I2efe4ecc76a88060a9aa5eb49d1fa6ea60bc5da8	2017-12-12 14:20:32 -08:00
David Chen	d9269e2ee7	Adds rate limit to checking byte size. Since there is a separate guardrail for memory used by uid map, we no longer add the memory from uid map with the memory per each config's metrics. We also prevent the byte size check from happening too frequently. In order to mock the MetricsManager, we refactor some of the existing methods. Test: Added unit-tests and verified they all pass on marlin. Change-Id: I15cf105f7d95f4016fdb0443b0a33eebe862cafb	2017-12-07 18:22:58 -08:00

19 Commits