WO2018196650A1 - User feature data acquisition method and device, server, and medium - Google Patents
User feature data acquisition method and device, server, and medium Download PDFInfo
- Publication number
- WO2018196650A1 WO2018196650A1 PCT/CN2018/083298 CN2018083298W WO2018196650A1 WO 2018196650 A1 WO2018196650 A1 WO 2018196650A1 CN 2018083298 W CN2018083298 W CN 2018083298W WO 2018196650 A1 WO2018196650 A1 WO 2018196650A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- data
- user
- access request
- data access
- database
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/25—Integrating or interfacing systems involving database management systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/178—Techniques for file synchronisation in file systems
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/18—File system types
- G06F16/182—Distributed file systems
Definitions
- the present application belongs to the field of Internet technologies, and in particular, to a method, an apparatus, a server, and a medium for acquiring user feature data.
- business systems For example, financial and insurance companies have various types of business systems, such as life insurance insurance systems, property insurance insurance systems, and customer personal information systems. Each business system has its own independent database, which stores the operational data of the enterprise in this business.
- the embodiment of the present application provides a method, an apparatus, a server, and a medium for acquiring user feature data, so as to solve the problem that in the prior art, when multi-service database is distributed, it is difficult to quickly acquire multi-dimensional user feature data. problem.
- a first aspect of the embodiment of the present application provides a method for acquiring user feature data, including:
- the calculation engine is used to analyze and summarize the synchronized user basic data, and the feature data corresponding to each user is obtained;
- a second aspect of the embodiments of the present application provides a device for acquiring user feature data, where the device for acquiring user feature data includes a module for performing a method for acquiring user feature data according to the above first aspect.
- a third aspect of an embodiment of the present application provides a server including a memory and a processor, the memory storing computer readable instructions executable on the processor, the processor executing the computer readable The steps of the method for acquiring user feature data as described in the first aspect are implemented when instructed.
- a fourth aspect of the embodiments of the present application provides a computer readable storage medium storing computer readable instructions, the computer readable instructions being executed by a processor to implement the first aspect as described in the first aspect The steps of the method of obtaining user characteristic data.
- the multi-service database in the case that the multi-service database is separated, by constructing a user attribute library that summarizes the basic data of each user, the feature data corresponding to each user can be quickly parsed based on the calculation engine, and the user feature data is realized. Comprehensive assessment. Moreover, since the analysis and aggregation process is performed in the user attribute library, it does not affect the operation of the existing business system. In addition, by receiving the data access request by using the interface, each service system can conveniently obtain the feature data of the user, and can provide accurate functions for personalized service customization and service recommendation according to different feature data corresponding to different users. data support. At the same time, since the business system needs to perform conversion processing on the user basic data in other service databases by itself, the efficiency of the user system for obtaining the user characteristic data is improved.
- FIG. 1 is a flowchart of an implementation of a method for acquiring user feature data provided by an embodiment of the present application
- FIG. 2 is a flowchart of an implementation of a method for acquiring user feature data according to another embodiment of the present application
- FIG. 3 is a specific implementation flowchart of a method S4 for acquiring user feature data according to an embodiment of the present application
- FIG. 4 is a flowchart of another specific implementation of the method S4 for acquiring user feature data provided by the embodiment of the present application;
- FIG. 5 is a schematic diagram of an application scenario applicable to a method for acquiring user feature data according to an embodiment of the present disclosure
- FIG. 6 is a structural block diagram of an apparatus for acquiring user feature data according to an embodiment of the present application.
- FIG. 7 is a schematic diagram of a server provided by an embodiment of the present application.
- the method for acquiring user feature data may be executed on a server on which a distributed file system is installed.
- the distributed file system can be a Hadoop distributed file system (HDFS file system), which can be adapted to run on common hardware (commodity) Hardware) provides high throughput data access.
- HDFS file system Hadoop distributed file system
- FIG. 1 is a flowchart showing an implementation process of a method for acquiring user feature data provided by an embodiment of the present application, which is described in detail as follows:
- S1 Synchronize user-based data in each business database to a distributed file system to build a user attribute library.
- the service database is used to manage and store data resources generated by the service system, and the data resources include user basic data.
- the user refers to the customer directly facing the business, that is, the user basic data is the business data or basic information related to the customer.
- the user-based data generated by the life insurance system is the personal basic data of the policy customer.
- User-based data generated by different business systems can be stored in different business databases.
- the HDFS file system of the user-based data constitutes a user attribute library, which is also a database.
- the synchronization method can meet the construction requirements of the user attribute library.
- the calculation engine is used to analyze and summarize the synchronized user basic data, and the feature data corresponding to each user is obtained.
- SPARK refers to Apache Spark, which is a fast and universal computing engine designed for large-scale data processing.
- the SPARK Compute Engine enables memory-distributed datasets to optimize iterative workloads in addition to providing interactive queries.
- the user attribute library uses the SPARK calculation engine and the ETL (Extraction, Transformation, Loading) tool to perform conversion and filtering analysis on the collected large amount of user basic data, and finally loads the processing result into the data warehouse according to the predefined data warehouse model.
- the processing result is the summarized user characteristic data.
- the user characteristic data stored in the user attribute database is summary information for the usage, usage habits, and attention service of the user in each business system, and is summary statistics for each user.
- a Hadoop distributed file system can be composed of multiple servers.
- the user base data is synchronized to the HDFS file system
- the user base data from different service databases can also be synchronized to different servers, and each server uses the SPARK calculation engine and the ETL tool to synchronize the users on the server.
- the basic data is analyzed and summarized, and then the analysis results of each server are summarized.
- the user attribute library uses multiple servers in parallel to implement analysis and summary of user basic data, the data processing efficiency of the system is improved.
- S3 Obtain a data access request sent by the service system based on the interface of the user attribute library.
- the interface is a shared boundary between the user attribute library and other external components for information exchange. It is developed by using the java servlet, and is composed of a preset expression program structure and data provided by a preset SQL language.
- the user's feature data is published externally by the interface of the user attribute library. If other business systems within the enterprise need to obtain or invoke the feature data of a certain user, the interface provided by the user attribute library is invoked by the post method to issue a data access request.
- the user attribute library interacts with the business system through the http protocol.
- step S1 specifically includes:
- the method for acquiring the user feature data further includes:
- the JOB is a function provided by the database to periodically execute a certain stored procedure or a package body.
- the data synchronization is implemented by the SQL JOB timing job at the database level.
- the implementation principle includes: creating a data synchronization task by using the JOB method, and in the execution condition corresponding to the data synchronization task, Set the task execution time and the task execution interval. When the current time reaches the preset task execution time, or when the difference between the current time and the task execution time is a multiple of the task execution interval, the HDFS file system is created to establish a database connection for each service.
- the database is connected.
- the SQL statement preset to the HDFS file system is executed, and according to the syntax analysis result of the SQL statement, if the table parameter corresponding to the SQL statement exists in the service database, the HDFS file system is preset from the service database by the identity authority.
- the user base data is read and the read user base data update is inserted into the HDFS file system.
- the HDFS file system creates a database connection that includes creating a dblink for the remote database. Based on the dblink method, the HDFS file system can quickly acquire the user base data of the remote service database as fast as accessing the local database, thereby improving synchronization efficiency.
- the data to be synchronized preset in the data synchronization task of the JOB mode is the user basic data generated by the service system within a preset time period
- the HDFS file system when the data synchronization task is executed, only puts the service system in the pre-predetermined time.
- the preset time period is from the first day of the month to the day before. That is, the HDFS file system only synchronizes the user-based data generated by the business system from the 1st of the month to the day.
- the user base data in the service database is still in the real-time update state because the day has not passed. Therefore, in order to prevent the HDFS file system from being synchronized, the user base data in the service database is updated again. As a result, the HDFS file system and the service database are not substantially synchronized.
- the HDFS file system performs the data synchronization task, only the user basic data generated by the business system before the day is synchronized.
- the effective time of the user's business is calculated on a monthly basis, and the 31st of each month is the expiration date, for example, the user is insured for 5 on October 2
- the monthly life insurance policy the March 31 of the second year is the expiration date of the policy.
- the HDFS file system only synchronizes monthly. User-based data generated by the business system from the 1st to the day.
- the user basic data is divided into a common level and an important level according to the user level.
- the user property library is pre-configured with two JOB tasks for setting the analysis summary action execution time of the user base data of the common level and the important level.
- the user attribute library performs an analysis and summary operation on the user-level data of the synchronized important level according to the rules preset by the JOB task, and performs the analysis summary operation on the weekly basis for the user-level data of the common level.
- the user attribute library performs an analysis summary operation on the user base data only during the non-access busy period.
- the business system usually concentrates on accessing the user characteristic data in the user attribute database during the time period from 09:00 to 18:00, and the JOB task analyzes and summarizes the user basic data that has been synchronized to the local.
- the task execution time can be set to 0:00 in the morning.
- the execution time of the analysis and summary action of the user attribute database is limited, so that when the service system issues a data access request, the user attribute database can respond in time, and does not occupy too much due to the analysis and summary operation.
- System resources improve the response efficiency of the user property library.
- S4 Verify the validity of the data access request, and if the data access request is legal, return the feature data of the user that matches the data access request to the service system.
- the user attribute database When the user attribute database receives the data access request based on the user feature data sent by any service system from the interface, it does not immediately return the user's feature data to the service system, but first determines whether the data access request is legal, including: Whether the source of the data access request satisfies the preset data acquisition authority and determines whether the data content of the data access request is abnormal. Only when the data access request is legal, the user property library queries and returns the user feature data requested by the business system. The user feature data requested by the service system is the feature data that matches the user number carried in the data access request. The user attribute library performs a query from the stored data according to the user number, and converts the feature data obtained by the query into data in an xml format and returns.
- FIG. 3 shows a specific implementation process of the foregoing step S4, which is described in detail as follows:
- S411 Determine whether the IP address carried by the data access request and the access key are both present in a configuration file of the user attribute database.
- the user attribute library parses the received data access request, and reads the first attribute value corresponding to the source IP field and the second attribute value corresponding to the access key field from the data report of the data access request, then the first An attribute value is an IP address carried by the data access request, and represents an IP address currently used by the service system that issues the data access request, and the second attribute value is an access key, which is input by the operator of the service system. Account key.
- the user property library loads the settings of the environment and the collection of files that it needs, which is the configuration file.
- the source IP address and the access key that are allowed to acquire the user feature data are pre-stored in the configuration file.
- the legal IP address saved in the configuration file includes the exact IP address, IP address segment, or IP address in the form of a wildcard. As long as the user attribute database detects that the IP address carried by the data access request is the same as the IP address in the configuration file or belongs to any IP address segment in the configuration file, it may be determined that the IP address carried by the data access request exists. In the configuration file of the user property library.
- IP address and the access key carried in the data access request are both in the configuration file of the user attribute database, acquire attribute information in the data access request, where the attribute information includes a data request parameter. And an encrypted visitor signature.
- the visitor signature is key data attached to the data access request or a transformation of the key data.
- the user attribute library decrypts the received visitor signature by using a pre-agreed decryption key, thereby determining whether the source of the data access request is a real authorized user, and confirming the message integrity of the data access request, thereby receiving the illegality
- the user attribute library can also recognize and respond to the data access request, thereby improving the transmission security of the user characteristic data.
- the user attribute library does not respond to the data access request.
- S413 Perform decryption processing on the visitor signature, determine whether the decrypted signature of the visitor is the same as the signature in the configuration file, and determine whether the data request parameter satisfies a preset condition.
- the preset condition is the grammatical structure rule, including the character type attribute and the range of the value range. Only when each data request parameter in the data access request satisfies a preset grammatical structure rule, the user attribute database determines that the data access request is legal and responds to the data access request, that is, according to the characteristic data of the required user. After querying the feature data corresponding to the user from the user attribute library, return to the business system that issued the data access request.
- the user attribute database If the decrypted visitor signature is different from the signature in the configuration file, and any data request parameter in the data access request does not satisfy the preset syntax structure rule, the user attribute database generates an error prompt information, and based on the error prompt information. Respond to the data access request.
- the IP address and the access key carried in the data access request exist in the configuration file of the user attribute database and the signature of the visitor is legal
- the content of the data access request message is "select" Name from persons”
- the parameter type after "from” should be the table name, and the user attribute library does not have the "persons" table, therefore, the user attribute library determines that the data request parameter is not satisfied.
- the default syntax structure rules, and the generated error message packet is returned through the interface to the business system that issues the data access request.
- the user attribute library when receiving the data access request, the user attribute library first determines whether the number of data access requests received in the first time period reaches a preset threshold.
- the first time period is a time interval from when the data access request is received to when the data access request is received, and the length of the first time period is a preset duration. If the number of data access requests received in the first time period has reached a preset threshold, the data access request received at the current time does not respond, and the above steps S411 to S414 are not performed.
- the user attribute library determines the number of data access requests within the preset duration, and can not make any response when the number exceeds the threshold, thereby preventing the user attribute library from continuously processing a large number of data access requests, and controlling The processing frequency of the data access request ensures the normal operation of the user attribute library.
- the view is performed in a view manner.
- the service system directly invokes the view provided by the user attribute library through the foregoing interface, and can directly request the user feature data corresponding to the view. Therefore, when the storage data is adjusted within the user attribute library, since the view and the interface are not changed, the service system that invokes the user attribute data of the user attribute library through the interface and the view will not be affected, and the adjustment process is performed on the service. The system is invisible.
- the interface can still pass through the interface.
- the user view data obtained by the embodiment of the present application ensures the normal operation of the service system and improves the reliability of the service system.
- FIG. 4 shows another specific implementation flow of the foregoing step S4, which is described in detail as follows:
- S421 Acquire an IP address of the data access request.
- S422 Acquire a total number of historical data access requests based on the IP address within a preset duration.
- the user attribute database determines that the source IP of the data access request received in the preset time period is the same as the IP address in S421.
- the preset time period is a time interval from a moment before the data access request is received to when the data access request is received, and the length of the preset time period is a preset duration.
- the detailed message of the above log file includes information such as the source time of each data access request, the request parameter, the source IP, the processing duration of the user attribute database for each data access request, and the processing result.
- the data access request is cached. If the duration of the data access request cached in the pre-created cache area has reached the preset duration, the user attribute database performs legality verification on the data access request.
- the data access request is not responded.
- the resource occupancy of the malicious data request to the user attribute database can be alleviated in the case of a malicious request, and the user attribute is guaranteed.
- the normal operation of the library by performing log records on the received data access requests, it is possible to provide maintenance personnel with troubleshooting clues and improve maintenance efficiency after a problem occurs in the user attribute library after the maintenance process.
- the user attribute library also provides a monitoring interface for the external monitoring server for monitoring the interface service of the user attribute library.
- the monitoring server continuously detects whether the monitoring interface returns a normal response packet for 24 hours. When the normal response packet is not received, it indicates that the interface service is abnormal. At this time, the monitoring server will issue an alarm prompt, so that the operation and maintenance personnel can quickly perform troubleshooting and restore the interface service as soon as possible, so that the user interface can be improved based on the monitoring interface. Reliability to avoid disruption of business systems.
- the application database is interconnected with the life insurance system database, the property insurance system database, and the financial fund database through a network.
- the application database is used to synchronize the basic information of each user in the life insurance system database, the property insurance system database and the wealth management fund database to the local, so that the application database is constructed as the above user attribute library.
- the application database utilizes the SPARK calculation engine to perform comprehensive analysis and processing on the basic information of each user that has been synchronized locally, and for any user, based on the life insurance insurance status of the user, the property insurance insurance status, and the purchase of the financial fund, etc.,
- the database will fully evaluate the user's consumption behavior characteristics data.
- the short MMS push system inside the enterprise requests and acquires the consumption behavior characteristic data of the user A in the application database via the user unified view query interface provided by the application database, and the short multimedia message pushing system returns the characteristics according to the application database.
- Data knowing that user A is accustomed to purchasing stock funds and hybrid funds, so the short-letter push system is selected to send push information about stock funds and hybrid funds, and the push information is sent to user A's mobile phone through SMS. . Therefore, based on the interface provided by the application database and the user characteristic data summarized and analyzed based on the application database, the business system such as the short multimedia messaging system realizes personalized customization of the user and service recommendation.
- FIG. 6 is a structural block diagram of the device for acquiring user feature data provided by the embodiment of the present application. For the convenience of description, only the embodiment of the present application is shown. Related parts.
- a device for acquiring user feature data includes:
- the synchronization module 61 is configured to synchronize user basic data in each service database to a distributed file system to construct a user attribute library.
- the summary module 62 is configured to analyze and summarize the synchronized user basic data by using a calculation engine to obtain feature data corresponding to each user.
- the obtaining module 63 is configured to obtain a data access request sent by the service system based on the interface of the user attribute library.
- the returning module 64 is configured to verify the validity of the data access request, and if the data access request is legal, return the feature data that matches the data access request to the service system.
- the returning module 74 includes:
- the determining submodule is configured to determine whether the IP address carried by the data access request and the access key are both present in a configuration file of the user attribute database.
- a first obtaining submodule configured to acquire attribute information in the data access request, if the IP address and the access key carried by the data access request are both in a configuration file of the user attribute database,
- the attribute information includes the visitor signature and the data request parameters.
- a first returning submodule configured to: if the decrypted signature of the visitor is the same as the signature in the configuration file, and the data request parameter satisfies a preset condition, the data access request is legal, and the The feature data matched by the data access request is returned to the business system.
- the synchronization module 61 includes:
- the first synchronization sub-module is configured to synchronize the user-based data in each service database to the distributed file system at a preset time interval based on the JOB mode to construct a user attribute library.
- the device for acquiring user feature data further includes:
- a storage module configured to save feature data corresponding to each user to a temporary table of the user attribute library, and delete the user basic data in each service database to the user attribute database next time The feature data already stored in the temporary table.
- the returning module 64 includes:
- the second obtaining submodule is configured to obtain an IP address of the data access request.
- the third obtaining submodule is configured to obtain a total number of historical data access requests based on the IP address within a preset duration.
- a cache submodule configured to cache the data access request if the total number of historical data access requests based on the IP address is greater than a preset threshold within a preset duration, and to cache the The validity of the data access request is verified.
- a second returning submodule configured to return the feature data that matches the data access request to the service system if the data access request is legal.
- the synchronization module 61 includes:
- the second synchronization sub-module is configured to synchronize user basic data in a preset time period to a distributed file system in each service database to construct a user attribute library.
- the preset time period is the first day of the current month to the day before the current time.
- FIG. 7 is a schematic diagram of a server according to an embodiment of the present application.
- the server 7 of this embodiment includes a processor 70 and a memory 71 in which computer readable instructions 72, such as user feature data, are executable to run on the processor 70. program.
- the processor 70 executes the computer readable instructions 72, the steps in the foregoing method for acquiring the respective user feature data are implemented, for example, steps 101 to 104 shown in FIG.
- the processor 70 when executing the computer readable instructions 72, implements the functions of the various modules/units in the various apparatus embodiments described above, such as the functions of the modules 61-64 shown in FIG.
- the computer readable instructions 72 may be partitioned into one or more modules/units that are stored in the memory 71 and executed by the processor 70, To complete this application.
- the one or more modules/units may be a series of computer readable instruction segments capable of performing a particular function for describing the execution of the computer readable instructions 72 in the server 7.
- the server 7 can be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server.
- the server may include, but is not limited to, processor 70 and memory 71. It will be understood by those skilled in the art that FIG. 7 is only an example of the server 7, and does not constitute a limitation on the server 7, and may include more or less components than those illustrated, or combine some components, or different components, such as
- the server may also include an input and output device, a network access device, a bus, and the like.
- the so-called processor 70 can be a central processing unit (Central Processing Unit, CPU), can also be other general-purpose processors, digital signal processors (DSP), application specific integrated circuits (Application Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc.
- the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
- the memory 71 may be an internal storage unit of the server 7, such as a hard disk or a memory of the server 7.
- the memory 71 may also be an external storage device of the server 7, such as a plug-in hard disk equipped with the server 7, a smart memory card (SMC), and a Secure Digital (SD) card. Flash card, etc.
- the memory 71 may also include both an internal storage unit of the server 7 and an external storage device.
- the memory 71 is configured to store the computer readable instructions and other programs and data required by the server.
- the memory 71 can also be used to temporarily store data that has been output or is about to be output.
- each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
- the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
- the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
- a computer readable storage medium A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application.
- the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A user feature data acquisition method and device, a server, and a medium are provided, applicable to the technical field of the Internet. The method comprises: synchronizing user basic data in each service database to a distributed file system to build a user attributes library; using a computing engine to analyze and summarize the synchronized user basic data to obtain feature data corresponding to each user; based on an interface of the user attribute library, obtaining a data access request sent by a service system; and verifying validity of the data access request, and if the data access request is valid, returning, to the service system, user feature data matching the data access request. When multiple service databases are separated, the solution can rapidly obtain feature data corresponding to each user. The user feature data is evaluated comprehensively, and operations of the service system are not affected. Accurate data support is provided for functions such as personalized service customization and service recommendation.
Description
本申请要求于2017年04月26日提交中国专利局、申请号为1 201710282779.4
、发明名称为“用户特征数据的获取方法及服务器”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application is submitted to the China Patent Office on April 26, 2017, and the application number is 1 201710282779.4.
The title of the invention is the priority of the Chinese Patent Application for "Using the User Feature Data Acquisition Method and Server", the entire contents of which are hereby incorporated by reference.
本申请属于互联网技术领域,尤其涉及一种用户特征数据的获取方法、装置、服务器及介质。The present application belongs to the field of Internet technologies, and in particular, to a method, an apparatus, a server, and a medium for acquiring user feature data.
随着企业信息化的发展,大部分企业都拥有了自己的业务系统。例如,金融保险公司拥有寿险投保系统、财险投保系统以及客户个人信息系统等各类型的业务系统。每个业务系统都有自己的独立数据库,分别存储着企业在该业务方面的运行数据。With the development of enterprise information, most companies have their own business systems. For example, financial and insurance companies have various types of business systems, such as life insurance insurance systems, property insurance insurance systems, and customer personal information systems. Each business system has its own independent database, which stores the operational data of the enterprise in this business.
然而,如果仅从一个独立数据库上查询业务系统的报表数据,则难以对用户的特征信息进行全面地评估,无法为用户个性化的业务定制以及业务推荐等功能提供数据支持。在跨多个业务数据库且业务数据量过大的情况之下,业务系统无法快速地解析出多维的用户特征分析结果,从而容易影响到业务系统的稳定运行。However, if the report data of the business system is only queried from a separate database, it is difficult to comprehensively evaluate the user's feature information, and it is unable to provide data support for the user's personalized service customization and service recommendation functions. In the case of a large number of business databases and a large amount of business data, the business system cannot quickly analyze multi-dimensional user feature analysis results, which easily affects the stable operation of the business system.
综上,现有技术中,在多业务数据库分布的情况下,存在难以快速获取多维的用户特征数据的问题。In summary, in the prior art, in the case of multi-service database distribution, there is a problem that it is difficult to quickly acquire multi-dimensional user feature data.
有鉴于此,本申请实施例提供了一种用户特征数据的获取方法、装置、服务器及介质,以解决现有技术中,在多业务数据库分布的情况下,难以快速获取多维的用户特征数据的问题。In view of this, the embodiment of the present application provides a method, an apparatus, a server, and a medium for acquiring user feature data, so as to solve the problem that in the prior art, when multi-service database is distributed, it is difficult to quickly acquire multi-dimensional user feature data. problem.
本申请实施例的第一方面提供了一种用户特征数据的获取方法,包括:A first aspect of the embodiment of the present application provides a method for acquiring user feature data, including:
将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;Synchronize user-based data in each business database to a distributed file system to build a user attribute library;
利用计算引擎对已同步的用户基础数据进行分析汇总,得到与各个用户分别对应的特征数据;The calculation engine is used to analyze and summarize the synchronized user basic data, and the feature data corresponding to each user is obtained;
基于所述用户属性库的接口,获取业务系统发出的数据访问请求;Obtaining a data access request sent by the service system based on the interface of the user attribute library;
对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统。Verifying the validity of the data access request, and if the data access request is legal, returning the feature data of the user that matches the data access request to the service system.
本申请实施例的第二方面提供了一种用户特征数据的获取装置,该用户特征数据的获取装置包括用于执行上述第一方面所述的用户特征数据的获取方法的模块。A second aspect of the embodiments of the present application provides a device for acquiring user feature data, where the device for acquiring user feature data includes a module for performing a method for acquiring user feature data according to the above first aspect.
本申请实施例的第三方面提供了一种服务器,包括存储器以及处理器,所述存储器中存储有可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如第一方面所述的用户特征数据的获取方法的步骤。A third aspect of an embodiment of the present application provides a server including a memory and a processor, the memory storing computer readable instructions executable on the processor, the processor executing the computer readable The steps of the method for acquiring user feature data as described in the first aspect are implemented when instructed.
本申请实施例的第四方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被处理器执行时实现如第一方面所述的用户特征数据的获取方法的步骤。A fourth aspect of the embodiments of the present application provides a computer readable storage medium storing computer readable instructions, the computer readable instructions being executed by a processor to implement the first aspect as described in the first aspect The steps of the method of obtaining user characteristic data.
本申请实施例中,在多业务数据库分离的情况下,通过构建汇总有各用户基础数据的用户属性库,能够基于计算引擎快速地解析出各个用户分别对应的特征数据,实现了对用户特征数据的全面评估。并且,由于分析汇总过程是在用户属性库进行的,因此不会影响到现有业务系统的运行。另外,通过利用接口接收数据访问请求,使得各个业务系统能够方便地获取用户的特征数据,进而能够根据不同用户所对应的不同特征数据,为用户个性化的业务定制以及业务推荐等功能提供准确的数据支持。同时,由于避免了业务系统需要自行对其他业务数据库中的用户基础数据执行换算处理,因此,提高了业务系统对于用户特征数据的获取效率。In the embodiment of the present application, in the case that the multi-service database is separated, by constructing a user attribute library that summarizes the basic data of each user, the feature data corresponding to each user can be quickly parsed based on the calculation engine, and the user feature data is realized. Comprehensive assessment. Moreover, since the analysis and aggregation process is performed in the user attribute library, it does not affect the operation of the existing business system. In addition, by receiving the data access request by using the interface, each service system can conveniently obtain the feature data of the user, and can provide accurate functions for personalized service customization and service recommendation according to different feature data corresponding to different users. data support. At the same time, since the business system needs to perform conversion processing on the user basic data in other service databases by itself, the efficiency of the user system for obtaining the user characteristic data is improved.
图1是本申请实施例提供的用户特征数据的获取方法的实现流程图;1 is a flowchart of an implementation of a method for acquiring user feature data provided by an embodiment of the present application;
图2是本申请另一实施例提供的用户特征数据的获取方法的实现流程图;2 is a flowchart of an implementation of a method for acquiring user feature data according to another embodiment of the present application;
图3是本申请实施例提供的用户特征数据的获取方法S4的一具体实现流程图;FIG. 3 is a specific implementation flowchart of a method S4 for acquiring user feature data according to an embodiment of the present application;
图4是本申请实施例提供的用户特征数据的获取方法S4的另一具体实现流程图;FIG. 4 is a flowchart of another specific implementation of the method S4 for acquiring user feature data provided by the embodiment of the present application;
图5是本申请实施例提供的用户特征数据的获取方法所适用的应用场景示意图;FIG. 5 is a schematic diagram of an application scenario applicable to a method for acquiring user feature data according to an embodiment of the present disclosure;
图6是本申请实施例提供的用户特征数据的获取装置的结构框图;FIG. 6 is a structural block diagram of an apparatus for acquiring user feature data according to an embodiment of the present application;
图7是本申请实施例提供的服务器的示意图。FIG. 7 is a schematic diagram of a server provided by an embodiment of the present application.
为了说明本申请所述的技术方案,下面通过具体实施例来进行说明。In order to explain the technical solutions described in the present application, the following description will be made by way of specific embodiments.
本申请实施例所述的用户特征数据的获取方法可在安装有分布式文件系统的服务器上执行。其中,分布式文件系统可以是Hadoop分布式文件系统(HDFS文件系统),其能够适合运行在通用硬件(commodity
hardware)上,能提供高吞吐量的数据访问。图1示出了本申请实施例提供的用户特征数据的获取方法的实现流程,详述如下:The method for acquiring user feature data according to the embodiment of the present application may be executed on a server on which a distributed file system is installed. Among them, the distributed file system can be a Hadoop distributed file system (HDFS file system), which can be adapted to run on common hardware (commodity)
Hardware) provides high throughput data access. FIG. 1 is a flowchart showing an implementation process of a method for acquiring user feature data provided by an embodiment of the present application, which is described in detail as follows:
S1:将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库。S1: Synchronize user-based data in each business database to a distributed file system to build a user attribute library.
本申请实施例中,业务数据库用于管理和存储业务系统所产生的数据资源,数据资源包括用户基础数据。其中,用户是指业务所直接面对的客户,即用户基础数据为与客户相关的业务资料或基本信息。例如,寿险投保系统所产生的用户基础数据为保单客户的个人基础数据。不同业务系统所产生的用户基础数据可存储于不同的业务数据库中。In the embodiment of the present application, the service database is used to manage and store data resources generated by the service system, and the data resources include user basic data. Among them, the user refers to the customer directly facing the business, that is, the user basic data is the business data or basic information related to the customer. For example, the user-based data generated by the life insurance system is the personal basic data of the policy customer. User-based data generated by different business systems can be stored in different business databases.
利用hadoop技术将各个业务数据库所存储的用户基础数据分别同步至HDFS文件系统后,由于用户基础数据包括用户在办理不同业务时的个人资料以及消费信息,因而在实质上,拥有各个业务系统所产生的用户基础数据的HDFS文件系统构成了一个用户属性库,其也是一个数据库。After the user-based data stored in each service database is synchronized to the HDFS file system by using the Hadoop technology, since the user-based data includes the personal data and the consumption information of the user when handling different services, the user-specific data system is generated substantially. The HDFS file system of the user-based data constitutes a user attribute library, which is also a database.
在上述用户基础数据的同步过程中,虽然存在有传输时间差,无法保持业务数据库与用户属性库之间的实时同步,但由于用户属性库对资源数据的实时性要求不高,因此上述用户基础数据的同步方式可以满足用户属性库的建设需求。In the synchronization process of the above-mentioned user basic data, although there is a transmission time difference, the real-time synchronization between the service database and the user attribute database cannot be maintained, but since the user attribute library has low requirements on the real-time performance of the resource data, the above-mentioned user basic data is not required. The synchronization method can meet the construction requirements of the user attribute library.
S2:利用计算引擎对已同步的用户基础数据进行分析汇总,得到与各个用户分别对应的特征数据。S2: The calculation engine is used to analyze and summarize the synchronized user basic data, and the feature data corresponding to each user is obtained.
本申请实施例中,SPARK是指Apache Spark,是专为大规模数据处理而设计的快速通用的计算引擎。SPARK计算引擎启用了内存分布数据集,除了能够提供交互式查询外,还可以优化迭代工作负载。In the embodiment of the present application, SPARK refers to Apache Spark, which is a fast and universal computing engine designed for large-scale data processing. The SPARK Compute Engine enables memory-distributed datasets to optimize iterative workloads in addition to providing interactive queries.
用户属性库采用SPARK计算引擎以及ETL(Extraction,Transformation,Loading)工具对收集完的大量用户基础数据执行转换和过滤分析处理后,最终按照预先定义好的数据仓库模型,将处理结果加载到数据仓库中,则该处理结果即为汇总后的用户特征数据。用户属性库所存储的用户特征数据是针对用户在各业务系统的使用情况、使用习惯以及关注业务的汇总信息,是针对每个用户进行的汇总统计。The user attribute library uses the SPARK calculation engine and the ETL (Extraction, Transformation, Loading) tool to perform conversion and filtering analysis on the collected large amount of user basic data, and finally loads the processing result into the data warehouse according to the predefined data warehouse model. In the middle, the processing result is the summarized user characteristic data. The user characteristic data stored in the user attribute database is summary information for the usage, usage habits, and attention service of the user in each business system, and is summary statistics for each user.
由于用户属性库是Hadoop分布式文件系统,因此,在实际的系统结构上,可以由多台服务器组成一个Hadoop分布式文件系统。用户基础数据同步至HDFS文件系统时,还可以将来源于不同业务数据库的用户基础数据分别同步至不同的服务器,由每台服务器分别采用SPARK计算引擎以及ETL工具对该台服务器上已同步的用户基础数据进行分析汇总,此后再将各台服务器的分析结果执行汇总。Since the user attribute library is a Hadoop distributed file system, in the actual system structure, a Hadoop distributed file system can be composed of multiple servers. When the user base data is synchronized to the HDFS file system, the user base data from different service databases can also be synchronized to different servers, and each server uses the SPARK calculation engine and the ETL tool to synchronize the users on the server. The basic data is analyzed and summarized, and then the analysis results of each server are summarized.
本申请实施例中,由于用户属性库使用了多台服务器并行的方式实现用户基础数据的分析汇总,因此,提高了系统的数据处理效率。In the embodiment of the present application, since the user attribute library uses multiple servers in parallel to implement analysis and summary of user basic data, the data processing efficiency of the system is improved.
S3:基于所述用户属性库的接口,获取业务系统发出的数据访问请求。S3: Obtain a data access request sent by the service system based on the interface of the user attribute library.
接口是用户属性库与外部其他部件进行信息交换的共享边界,采用java的servlet进行开发,以预设的表达程序结构的形式以及预设的SQL语言提供的数据组成。用户的特征数据均由用户属性库的接口对外进行发布。企业内部的其他业务系统如果需要获取或调用某个用户的特征数据,则采用post的方式自行调用用户属性库提供的接口,进而发出数据访问请求。其中,用户属性库通过http协议与业务系统实行交互。The interface is a shared boundary between the user attribute library and other external components for information exchange. It is developed by using the java servlet, and is composed of a preset expression program structure and data provided by a preset SQL language. The user's feature data is published externally by the interface of the user attribute library. If other business systems within the enterprise need to obtain or invoke the feature data of a certain user, the interface provided by the user attribute library is invoked by the post method to issue a data access request. The user attribute library interacts with the business system through the http protocol.
作为本申请的另一个实施例,如图2所示,上述步骤S1具体包括:As another embodiment of the present application, as shown in FIG. 2, the foregoing step S1 specifically includes:
S11:基于JOB方式,以预设的时间间隔将各个业务数据库中的用户基础数据同步至Hadoop分布式文件系统,以构建用户属性库。S11: Based on the JOB mode, the user basic data in each service database is synchronized to the Hadoop distributed file system at a preset time interval to construct a user attribute library.
在上述S2之后,S3之前,上述用户特征数据的获取方法还包括:After the foregoing S2, before S3, the method for acquiring the user feature data further includes:
S21:将所述各个用户分别对应的特征数据保存至所述用户属性库的临时表,并在下一次将各个业务数据库中的用户基础数据同步至所述用户属性库之前,删除所述临时表中已存储的所述特征数据。S21: Save the feature data corresponding to each user to a temporary table of the user attribute library, and delete the temporary table in the next time before the user basic data in each service database is synchronized to the user attribute database. The feature data that has been stored.
本申请实施例中,JOB为数据库所提供的一个定期执行某个存储过程或者包体的功能。为了使HDFS文件系统能够触发同步操作,在数据库层面通过SQL JOB定时作业的方式实现数据同步,其实现原理包括:通过JOB方式创建数据同步任务,并在该数据同步任务所对应的执行条件中,设置任务执行时刻以及任务执行间隔。当当前的时刻达到该预设的任务执行时刻时,或者,当当前的时刻与该任务执行时刻的差值为任务执行间隔的倍数时,令HDFS文件系统创建数据库连接,用于分别与各个业务数据库相连。此后,执行预设于HDFS文件系统的SQL语句,根据SQL语句的语法解析结果,若业务数据库中存在有SQL语句所对应的表参数时,则令HDFS文件系统以预设的身份权限从业务数据库中读取用户基础数据,并将读取到的用户基础数据更新插入至HDFS文件系统。In the embodiment of the present application, the JOB is a function provided by the database to periodically execute a certain stored procedure or a package body. In order to enable the HDFS file system to trigger the synchronization operation, the data synchronization is implemented by the SQL JOB timing job at the database level. The implementation principle includes: creating a data synchronization task by using the JOB method, and in the execution condition corresponding to the data synchronization task, Set the task execution time and the task execution interval. When the current time reaches the preset task execution time, or when the difference between the current time and the task execution time is a multiple of the task execution interval, the HDFS file system is created to establish a database connection for each service. The database is connected. Thereafter, the SQL statement preset to the HDFS file system is executed, and according to the syntax analysis result of the SQL statement, if the table parameter corresponding to the SQL statement exists in the service database, the HDFS file system is preset from the service database by the identity authority. The user base data is read and the read user base data update is inserted into the HDFS file system.
特别地,HDFS文件系统创建数据库连接包括创建远程数据库的dblink。基于dblink的方式,HDFS文件系统能够像访问本地数据库一样快速获取远程业务数据库的用户基础数据,提高同步效率。In particular, the HDFS file system creates a database connection that includes creating a dblink for the remote database. Based on the dblink method, the HDFS file system can quickly acquire the user base data of the remote service database as fast as accessing the local database, thereby improving synchronization efficiency.
若JOB方式的数据同步任务中所预设的需要同步的数据为业务系统在预设时间段内所产生的用户基础数据,则执行数据同步任务时,HDFS文件系统只会将业务系统在该预设时间段内所产生的用户基础数据同步至本地。优选地,该预设时间段为当月第一天至当天之前。即,HDFS文件系统只同步每月的1号至当天之前由业务系统所产生的用户基础数据。If the data to be synchronized preset in the data synchronization task of the JOB mode is the user basic data generated by the service system within a preset time period, when the data synchronization task is executed, the HDFS file system only puts the service system in the pre-predetermined time. Set the user base data generated during the time period to be synchronized to the local. Preferably, the preset time period is from the first day of the month to the day before. That is, the HDFS file system only synchronizes the user-based data generated by the business system from the 1st of the month to the day.
在数据同步任务执行的当天,由于当天还没过去,业务数据库中的用户基础数据依然处于实时更新状态,因此,为了避免HDFS文件系统在同步后,因业务数据库中的用户基础数据又出现了更新而导致HDFS文件系统与业务数据库在实质上并不同步的现象发生,HDFS文件系统每次执行数据同步任务时,只同步当天之前由业务系统所产生的用户基础数据。另外,在实际操作中,因用户的业务生效时长都是以月为单位来计算的,并且都是以每个月的31日为失效日,例如,用户在10月2日投保了为期5个月的人寿保单,则第二年的3月31日为该份保单的失效日。故每月1号之前业务数据库中的大部分用户基础数据都是历史数据,为了提高HDFS文件系统对于用户基础数据的同步有效性、降低同步数据量以及提高同步效率,HDFS文件系统仅同步每月的1号至当天之前由业务系统所产生的用户基础数据。On the day of the data synchronization task execution, the user base data in the service database is still in the real-time update state because the day has not passed. Therefore, in order to prevent the HDFS file system from being synchronized, the user base data in the service database is updated again. As a result, the HDFS file system and the service database are not substantially synchronized. When the HDFS file system performs the data synchronization task, only the user basic data generated by the business system before the day is synchronized. In addition, in actual operation, since the effective time of the user's business is calculated on a monthly basis, and the 31st of each month is the expiration date, for example, the user is insured for 5 on October 2 The monthly life insurance policy, the March 31 of the second year is the expiration date of the policy. Therefore, most of the user-based data in the service database before the 1st of each month is historical data. In order to improve the synchronization effectiveness of the HDFS file system for user-based data, reduce the amount of synchronization data, and improve synchronization efficiency, the HDFS file system only synchronizes monthly. User-based data generated by the business system from the 1st to the day.
特别地,在用户属性库中,根据用户级别的不同,用户基础数据分为普通级别和重要级别。用户属性库预设有两个JOB任务,分别用于设置关于普通级别和重要级别的用户基础数据的分析汇总动作执行时刻。In particular, in the user attribute library, the user basic data is divided into a common level and an important level according to the user level. The user property library is pre-configured with two JOB tasks for setting the analysis summary action execution time of the user base data of the common level and the important level.
例如,用户属性库根据JOB任务所预设的规则,对已同步的重要级别的用户基础数据每天执行一次分析汇总操作,而对普通级别的用户基础数据则每周才执行一次分析汇总操作。For example, the user attribute library performs an analysis and summary operation on the user-level data of the synchronized important level according to the rules preset by the JOB task, and performs the analysis summary operation on the weekly basis for the user-level data of the common level.
优选地,用户属性库仅在非访问繁忙时段才执行对用户基础数据的分析汇总操作。例如,根据统计结果,业务系统通常会集中在09:00至18:00的时间段访问用户属性库中的用户特征数据,则JOB任务中对已同步于本地的用户基础数据的分析汇总动作的任务执行时刻可以设置为凌晨0:00。Preferably, the user attribute library performs an analysis summary operation on the user base data only during the non-access busy period. For example, according to the statistical result, the business system usually concentrates on accessing the user characteristic data in the user attribute database during the time period from 09:00 to 18:00, and the JOB task analyzes and summarizes the user basic data that has been synchronized to the local. The task execution time can be set to 0:00 in the morning.
本申请实施例中,通过对用户属性库的分析汇总动作执行时刻进行限定,保证了在业务系统发出数据访问请求时,用户属性库能够及时作出响应,不会因分析汇总操作而占用过多的系统资源,提高了用户属性库的响应效率。通过将特征数据储存于临时表,能够减少用户属性库内存资源的消耗,通过定时对临时表中的特征数据进行清除操作,保证临时表中的数据量不会过多,使用户属性库有更好的性能。In the embodiment of the present application, the execution time of the analysis and summary action of the user attribute database is limited, so that when the service system issues a data access request, the user attribute database can respond in time, and does not occupy too much due to the analysis and summary operation. System resources improve the response efficiency of the user property library. By storing the feature data in the temporary table, the consumption of the memory resources of the user attribute library can be reduced, and the feature data in the temporary table can be cleared by timing to ensure that the amount of data in the temporary table is not excessive, so that the user attribute library has more Good performance.
S4:对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统。S4: Verify the validity of the data access request, and if the data access request is legal, return the feature data of the user that matches the data access request to the service system.
用户属性库从接口接收到任一业务系统发出的基于用户特征数据的数据访问请求时,并不是马上将用户的特征数据返回至业务系统,而是先判断该数据访问请求是否合法,包括:判断数据访问请求的来源者是否满足预设的数据获取权限以及判断数据访问请求的数据内容是否参数异常。只有当数据访问请求合法时,用户属性库才查询并返回业务系统所请求获取的用户特征数据。业务系统请求获取的用户特征数据为与数据访问请求中所携带的用户号码相匹配的特征数据。用户属性库根据该用户号码从已存储的数据中执行查询,并将查询得到的特征数据转换为xml格式的数据后返回。When the user attribute database receives the data access request based on the user feature data sent by any service system from the interface, it does not immediately return the user's feature data to the service system, but first determines whether the data access request is legal, including: Whether the source of the data access request satisfies the preset data acquisition authority and determines whether the data content of the data access request is abnormal. Only when the data access request is legal, the user property library queries and returns the user feature data requested by the business system. The user feature data requested by the service system is the feature data that matches the user number carried in the data access request. The user attribute library performs a query from the stored data according to the user number, and converts the feature data obtained by the query into data in an xml format and returns.
作为本申请的一个实施例,图3示出了上述步骤S4的具体实现流程,详述如下:As an embodiment of the present application, FIG. 3 shows a specific implementation process of the foregoing step S4, which is described in detail as follows:
S411:判断所述数据访问请求所携带的IP地址以及访问密钥是否均存在于所述用户属性库的配置文件中。S411: Determine whether the IP address carried by the data access request and the access key are both present in a configuration file of the user attribute database.
用户属性库对接收到的数据访问请求进行解析处理,从数据访问请求的数据报中读取源IP字段所对应的第一属性值以及访问密钥字段所对应的第二属性值,则该第一属性值即为上述数据访问请求所携带的IP地址,表示了发出数据访问请求的业务系统当前所使用的IP地址,第二属性值即为访问密钥,表示业务系统的操作者所输入的账号密钥。The user attribute library parses the received data access request, and reads the first attribute value corresponding to the source IP field and the second attribute value corresponding to the access key field from the data report of the data access request, then the first An attribute value is an IP address carried by the data access request, and represents an IP address currently used by the service system that issues the data access request, and the second attribute value is an access key, which is input by the operator of the service system. Account key.
在运行过程中,用户属性库会加载自身所需环境的设置和文件的集合,该集合即为配置文件。为了对数据访问请求的合法性进行验证,将允许获取用户特征数据的源IP地址和访问密钥预先保存于配置文件中。配置文件中所保存的合法IP地址包括精确IP地址、IP地址段或者以通配符形式表示的IP地址。只要用户属性库检测到数据访问请求所携带的IP地址与该配置文件中的IP地址相同或者属于该配置文件中的任一IP地址段之内,则可确定数据访问请求所携带的IP地址存在于用户属性库的配置文件中。During the run, the user property library loads the settings of the environment and the collection of files that it needs, which is the configuration file. In order to verify the validity of the data access request, the source IP address and the access key that are allowed to acquire the user feature data are pre-stored in the configuration file. The legal IP address saved in the configuration file includes the exact IP address, IP address segment, or IP address in the form of a wildcard. As long as the user attribute database detects that the IP address carried by the data access request is the same as the IP address in the configuration file or belongs to any IP address segment in the configuration file, it may be determined that the IP address carried by the data access request exists. In the configuration file of the user property library.
S412:若所述数据访问请求所携带的IP地址以及访问密钥均存在于所述用户属性库的配置文件中,则获取所述数据访问请求中的属性信息,所述属性信息包括数据请求参数以及已加密的访问者签名。S412: If the IP address and the access key carried in the data access request are both in the configuration file of the user attribute database, acquire attribute information in the data access request, where the attribute information includes a data request parameter. And an encrypted visitor signature.
本申请实施例中,访问者签名是附加在数据访问请求上的密钥数据,或是对密钥数据所作的一种变换。用户属性库利用预先约定的解密密钥对接收到的访问者签名进行解密,从而判定数据访问请求的来源是否为真实的授权用户,并确认数据访问请求的报文完整性,从而在接收到不法分子所伪造处理的数据访问请求时,用户属性库也能识别出来并对该数据访问请求不作响应,由此提高用户特征数据的传输安全。In the embodiment of the present application, the visitor signature is key data attached to the data access request or a transformation of the key data. The user attribute library decrypts the received visitor signature by using a pre-agreed decryption key, thereby determining whether the source of the data access request is a real authorized user, and confirming the message integrity of the data access request, thereby receiving the illegality When the numerator falsifies the data access request, the user attribute library can also recognize and respond to the data access request, thereby improving the transmission security of the user characteristic data.
特别地,若数据访问请求所携带的IP地址或访问密钥不存在于用户属性库的配置文件中,则用户属性库对该数据访问请求不作响应。In particular, if the IP address or access key carried by the data access request does not exist in the configuration file of the user attribute library, the user attribute library does not respond to the data access request.
S413:对所述访问者签名进行解密处理,判断解密后的所述访问者签名与所述配置文件中的签名是否相同,并判断所述数据请求参数是否满足预设条件。S413: Perform decryption processing on the visitor signature, determine whether the decrypted signature of the visitor is the same as the signature in the configuration file, and determine whether the data request parameter satisfies a preset condition.
S414:若解密后的所述访问者签名与所述配置文件中的签名相同,且所述数据请求参数满足预设条件,则确定所述数据访问请求合法,并将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。S414: If the decrypted signature of the visitor is the same as the signature in the configuration file, and the data request parameter meets a preset condition, determining that the data access request is legal and matching the data access request The feature data is returned to the business system.
预设条件即语法结构规则,包括字符类型属性以及取值范围区间等规则。只有数据访问请求中的各数据请求参数均满足预设的语法结构规则时,用户属性库才确定该数据访问请求合法,并对该数据访问请求作出响应,即,根据其所需用户的特征数据,从用户属性库中查询对应该用户的特征数据后,返回至发出数据访问请求的业务系统。The preset condition is the grammatical structure rule, including the character type attribute and the range of the value range. Only when each data request parameter in the data access request satisfies a preset grammatical structure rule, the user attribute database determines that the data access request is legal and responds to the data access request, that is, according to the characteristic data of the required user. After querying the feature data corresponding to the user from the user attribute library, return to the business system that issued the data access request.
若解密后的访问者签名与配置文件中的签名不相同,数据访问请求中的任一数据请求参数不满足预设的语法结构规则,则用户属性库生成错误提示信息,并基于该错误提示信息对该数据访问请求作出响应。If the decrypted visitor signature is different from the signature in the configuration file, and any data request parameter in the data access request does not satisfy the preset syntax structure rule, the user attribute database generates an error prompt information, and based on the error prompt information. Respond to the data access request.
例如,在数据访问请求所携带的IP地址和访问密钥均存在于用户属性库的配置文件以及访问者签名合法的情况之下,若数据访问请求的报文内容为“select
name from persons”,则根据语法结构规则,“from”后面的参数类型应当是表名,而用户属性库中,并不存在“persons”这个表,因此,用户属性库确定该数据请求参数不满足预设的语法结构规则,并将生成的错误提示数据包通过接口返回至发出数据访问请求的业务系统。For example, if the IP address and the access key carried in the data access request exist in the configuration file of the user attribute database and the signature of the visitor is legal, if the content of the data access request message is "select"
Name from persons", according to the grammatical structure rules, the parameter type after "from" should be the table name, and the user attribute library does not have the "persons" table, therefore, the user attribute library determines that the data request parameter is not satisfied. The default syntax structure rules, and the generated error message packet is returned through the interface to the business system that issues the data access request.
优选地,用户属性库在接收到数据访问请求时,先判断第一时间段内所接收到的数据访问请求的数量是否达到了预设阈值。其中,第一时间段为接收到该数据访问请求之前的某一时刻至接收到该数据访问请求时的时间区间,且第一时间段的长度为预设时长。若第一时间段内所接收到的数据访问请求的数量已达到了预设阈值,则对当前时刻所接收到的数据访问请求不作响应,也不执行上述步骤S411至步骤S414。Preferably, when receiving the data access request, the user attribute library first determines whether the number of data access requests received in the first time period reaches a preset threshold. The first time period is a time interval from when the data access request is received to when the data access request is received, and the length of the first time period is a preset duration. If the number of data access requests received in the first time period has reached a preset threshold, the data access request received at the current time does not respond, and the above steps S411 to S414 are not performed.
在本申请实施例中,用户属性库通过对预设时长内的数据访问请求的数量进行判断,能够在数量超过阈值时不作出任何响应,避免了用户属性库持续处理大量的数据访问请求,控制了关于数据访问请求的处理频率,保证了用户属性库的正常运行。In the embodiment of the present application, the user attribute library determines the number of data access requests within the preset duration, and can not make any response when the number exceeds the threshold, thereby preventing the user attribute library from continuously processing a large number of data access requests, and controlling The processing frequency of the data access request ensures the normal operation of the user attribute library.
特别地,用户属性库对外提供查询用户特征数据时,均采用视图的方式进行。业务系统通过上述接口直接调用用户属性库所提供的视图,能够直接请求获取该视图所对应的用户特征数据。因此,在用户属性库内部发生存储数据的调整时,由于视图与接口均未发生改变,因而通过接口以及视图调用用户属性库内部用户特征数据的业务系统将不会受到影响,该调整过程对业务系统来说是不可见的。In particular, when the user attribute database provides querying user feature data externally, the view is performed in a view manner. The service system directly invokes the view provided by the user attribute library through the foregoing interface, and can directly request the user feature data corresponding to the view. Therefore, when the storage data is adjusted within the user attribute library, since the view and the interface are not changed, the service system that invokes the user attribute data of the user attribute library through the interface and the view will not be affected, and the adjustment process is performed on the service. The system is invisible.
例如,在用户属性库对历史的用户特征数据进行转移或者对大批量的用户特征数据执行分表操作时,虽然用户特征数据的存储结构发生了改变,但在业务系统看来,依然能通过接口调用相同的一个视图来获取自身所需的用户特征数据,因此,本申请实施例提供的用户特征数据获取方法保证了业务系统的正常运作,提高了业务系统的可靠性。For example, when the user attribute database transfers historical user feature data or performs sub-table operations on a large number of user feature data, although the storage structure of the user feature data changes, in the view of the business system, the interface can still pass through the interface. The user view data obtained by the embodiment of the present application ensures the normal operation of the service system and improves the reliability of the service system.
作为本申请的一个实施例,图4示出了上述步骤S4的另一具体实现流程,详述如下:As an embodiment of the present application, FIG. 4 shows another specific implementation flow of the foregoing step S4, which is described in detail as follows:
S421:获取所述数据访问请求的IP地址。S421: Acquire an IP address of the data access request.
S422:获取预设时长内基于所述IP地址的历史数据访问请求的总数。S422: Acquire a total number of historical data access requests based on the IP address within a preset duration.
本申请实施例中,根据访问日志文件,用户属性库判断预设时间段内所接收到的数据访问请求的来源IP与S421中的IP地址相同的数量。其中,预设时间段为接收到该数据访问请求之前某一时刻至接收到该数据访问请求时的时间区间,且该预设时间段的长度为预设时长。In the embodiment of the present application, according to the access log file, the user attribute database determines that the source IP of the data access request received in the preset time period is the same as the IP address in S421. The preset time period is a time interval from a moment before the data access request is received to when the data access request is received, and the length of the preset time period is a preset duration.
上述日志文件的详细报文中记载有各个数据访问请求的来源时间、请求参数、来源IP、用户属性库对各个数据访问请求的处理时长以及处理结果等信息。The detailed message of the above log file includes information such as the source time of each data access request, the request parameter, the source IP, the processing duration of the user attribute database for each data access request, and the processing result.
S423:若预设时长内基于所述IP地址的历史数据访问请求的总数大于预设阈值,则对所述数据访问请求进行缓存,并在预设时延后对缓存的所述数据访问请求的合法性进行验证。S423: If the total number of historical data access requests based on the IP address is greater than a preset threshold within a preset duration, the data access request is cached, and the cached data access request is cached after a preset time delay. Legality is verified.
若预设时长内基于所述IP地址的数据访问请求的总数大于第一预设阈值且小于第二预设阈值,则对所述数据访问请求进行缓存。若数据访问请求缓存于预先创建的缓存区的时长已经达到了预设时长,则用户属性库对该数据访问请求进行合法性验证。And if the total number of data access requests based on the IP address in the preset duration is greater than a first preset threshold and less than a second preset threshold, the data access request is cached. If the duration of the data access request cached in the pre-created cache area has reached the preset duration, the user attribute database performs legality verification on the data access request.
若预设时长内基于所述IP地址的数据访问请求的总数大于第二预设阈值,则不对该数据访问请求进行响应。If the total number of data access requests based on the IP address within the preset duration is greater than a second predetermined threshold, the data access request is not responded.
S424:若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。S424: If the data access request is legal, return the feature data that matches the data access request to the service system.
本申请实施例中未提到的步骤原理与上述各个实施例中的步骤原理一致,因此不再一一赘述。The principle of the steps not mentioned in the embodiment of the present application is consistent with the principle of the steps in the foregoing embodiments, and therefore will not be further described.
在本申请实施例中,通过对同一来源IP的数据访问请求进行延时处理或者不作出响应,能够在恶意请求出现的情况下,减轻恶意数据请求对用户属性库的资源占用,保障了用户属性库的正常工作;通过对接收到的各个数据访问请求执行日志记录,能够在后期维护过程中后,当用户属性库出现问题时,为维护人员提供排查线索,提高维护效率。In the embodiment of the present application, by delaying or not responding to the data access request of the same source IP, the resource occupancy of the malicious data request to the user attribute database can be alleviated in the case of a malicious request, and the user attribute is guaranteed. The normal operation of the library; by performing log records on the received data access requests, it is possible to provide maintenance personnel with troubleshooting clues and improve maintenance efficiency after a problem occurs in the user attribute library after the maintenance process.
进一步地,用户属性库除了提供上述接口供外部业务系统获取用户的特征数据之外,还为外部的监控服务器提供了监控接口,用于对用户属性库的接口服务进行监控。监控服务器24小时持续检测监控接口是否返回正常响应数据包。当未接收到正常响应数据包时,表示接口服务异常,此时,监控服务器将发出告警提示,以便运维人员迅速执行故障排查工作,尽快恢复接口服务,从而基于该监控接口能够提高用户属性库的可靠性,避免业务系统受到影响。Further, in addition to providing the foregoing interface for the external service system to acquire the feature data of the user, the user attribute library also provides a monitoring interface for the external monitoring server for monitoring the interface service of the user attribute library. The monitoring server continuously detects whether the monitoring interface returns a normal response packet for 24 hours. When the normal response packet is not received, it indicates that the interface service is abnormal. At this time, the monitoring server will issue an alarm prompt, so that the operation and maintenance personnel can quickly perform troubleshooting and restore the interface service as soon as possible, so that the user interface can be improved based on the monitoring interface. Reliability to avoid disruption of business systems.
作为本申请实施例的一个具体实施示例,在图5所示的一个实际应用场景中,应用数据库分别与寿险系统数据库、财险系统数据库以及理财基金数据库通过网络互连。应用数据库用于将寿险系统数据库、财险系统数据库以及理财基金数据库中各用户的基础信息同步至本地,以令应用数据库构建为上述用户属性库。此后,应用数据库利用SPARK计算引擎对已同步于本地的每个用户的基础信息进行综合分析处理,对于任一用户,基于该用户的寿险投保状况、财险投保状况以及理财基金购买情况等,应用数据库将全面评估得到该用户的消费行为特征数据。另一方面,企业内部的短彩信推送系统经由应用数据库所提供的用户统一视图查询接口,请求并获取应用数据库中用户A的消费行为特征数据,则该短彩信推送系统根据应用数据库所返回的特征数据,得知用户A习惯购买股票型基金和混合型基金,因而筛选短彩信推送系统出关于股票型基金和混合型基金的推送信息,并通过短信方式将该推送信息发送至用户A的手机中。因此,基于应用数据库所提供的接口以及基于应用数据库所汇总分析的用户特征数据,短彩信推送系统等业务系统实现了用户个性化的业务定制以及业务推荐等功能。As a specific implementation example of the embodiment of the present application, in an actual application scenario shown in FIG. 5, the application database is interconnected with the life insurance system database, the property insurance system database, and the financial fund database through a network. The application database is used to synchronize the basic information of each user in the life insurance system database, the property insurance system database and the wealth management fund database to the local, so that the application database is constructed as the above user attribute library. Thereafter, the application database utilizes the SPARK calculation engine to perform comprehensive analysis and processing on the basic information of each user that has been synchronized locally, and for any user, based on the life insurance insurance status of the user, the property insurance insurance status, and the purchase of the financial fund, etc., The database will fully evaluate the user's consumption behavior characteristics data. On the other hand, the short MMS push system inside the enterprise requests and acquires the consumption behavior characteristic data of the user A in the application database via the user unified view query interface provided by the application database, and the short multimedia message pushing system returns the characteristics according to the application database. Data, knowing that user A is accustomed to purchasing stock funds and hybrid funds, so the short-letter push system is selected to send push information about stock funds and hybrid funds, and the push information is sent to user A's mobile phone through SMS. . Therefore, based on the interface provided by the application database and the user characteristic data summarized and analyzed based on the application database, the business system such as the short multimedia messaging system realizes personalized customization of the user and service recommendation.
对应于上文实施例所述的用户特征数据的获取方法,图6示出了本申请实施例提供的用户特征数据的获取装置的结构框图,为了便于说明,仅示出了与本申请实施例相关的部分。Corresponding to the method for acquiring the user feature data described in the foregoing embodiment, FIG. 6 is a structural block diagram of the device for acquiring user feature data provided by the embodiment of the present application. For the convenience of description, only the embodiment of the present application is shown. Related parts.
如图6所示,本实施例中,一种用户特征数据的获取装置包括:As shown in FIG. 6, in this embodiment, a device for acquiring user feature data includes:
同步模块61,用于将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库。The synchronization module 61 is configured to synchronize user basic data in each service database to a distributed file system to construct a user attribute library.
汇总模块62,用于利用计算引擎对已同步的所述用户基础数据进行分析汇总,得到各个用户分别对应的特征数据。The summary module 62 is configured to analyze and summarize the synchronized user basic data by using a calculation engine to obtain feature data corresponding to each user.
获取模块63,用于基于所述用户属性库的接口,获取业务系统发出的数据访问请求。The obtaining module 63 is configured to obtain a data access request sent by the service system based on the interface of the user attribute library.
返回模块64,用于对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。The returning module 64 is configured to verify the validity of the data access request, and if the data access request is legal, return the feature data that matches the data access request to the service system.
可选地,所述返回模块74包括:Optionally, the returning module 74 includes:
判断子模块,用于判断所述数据访问请求所携带的IP地址以及访问密钥是否均存在于所述用户属性库的配置文件中。The determining submodule is configured to determine whether the IP address carried by the data access request and the access key are both present in a configuration file of the user attribute database.
第一获取子模块,用于若所述数据访问请求所携带的IP地址以及访问密钥均存在于所述用户属性库的配置文件中,则获取所述数据访问请求中的属性信息,所述属性信息包括访问者签名以及数据请求参数。a first obtaining submodule, configured to acquire attribute information in the data access request, if the IP address and the access key carried by the data access request are both in a configuration file of the user attribute database, The attribute information includes the visitor signature and the data request parameters.
第一返回子模块,用于若解密后的所述访问者签名与所述配置文件中的签名相同,且所述数据请求参数满足预设条件,则所述数据访问请求合法,并将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。a first returning submodule, configured to: if the decrypted signature of the visitor is the same as the signature in the configuration file, and the data request parameter satisfies a preset condition, the data access request is legal, and the The feature data matched by the data access request is returned to the business system.
可选地,所述同步模块61包括:Optionally, the synchronization module 61 includes:
第一同步子模块,用于基于JOB方式,以预设的时间间隔将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库。The first synchronization sub-module is configured to synchronize the user-based data in each service database to the distributed file system at a preset time interval based on the JOB mode to construct a user attribute library.
所述用户特征数据的获取装置还包括:The device for acquiring user feature data further includes:
存储模块,用于将所述各个用户分别对应的特征数据保存至所述用户属性库的临时表,并在下一次将各个业务数据库中的用户基础数据同步至所述用户属性库之前,删除所述临时表中已存储的所述特征数据。a storage module, configured to save feature data corresponding to each user to a temporary table of the user attribute library, and delete the user basic data in each service database to the user attribute database next time The feature data already stored in the temporary table.
可选地,所述返回模块64包括:Optionally, the returning module 64 includes:
第二获取子模块,用于获取所述数据访问请求的IP地址。The second obtaining submodule is configured to obtain an IP address of the data access request.
第三获取子模块,用于获取预设时长内基于所述IP地址的历史数据访问请求的总数。The third obtaining submodule is configured to obtain a total number of historical data access requests based on the IP address within a preset duration.
缓存子模块,用于若预设时长内基于所述IP地址的历史数据访问请求的总数大于预设阈值,则对所述数据访问请求进行缓存,并在预设时延后对缓存的所述数据访问请求的合法性进行验证。a cache submodule, configured to cache the data access request if the total number of historical data access requests based on the IP address is greater than a preset threshold within a preset duration, and to cache the The validity of the data access request is verified.
第二返回子模块,用于若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。And a second returning submodule, configured to return the feature data that matches the data access request to the service system if the data access request is legal.
可选地,所述同步模块61包括:Optionally, the synchronization module 61 includes:
第二同步子模块,用于在各个业务数据库中,将预设时间段内的用户基础数据同步至分布式文件系统,以构建用户属性库。The second synchronization sub-module is configured to synchronize user basic data in a preset time period to a distributed file system in each service database to construct a user attribute library.
其中,所述预设时间段为当月第一天至当前时刻的前一天。The preset time period is the first day of the current month to the day before the current time.
图7是本申请一实施例提供的服务器的示意图。如图7所示,该实施例的服务器7包括:处理器70以及存储器71,所述存储器71中存储有可在所述处理器70上运行的计算机可读指令72,例如用户特征数据的获取程序。所述处理器70执行所述计算机可读指令72时实现上述各个用户特征数据的获取方法实施例中的步骤,例如图1所示的步骤101至104。或者,所述处理器70执行所述计算机可读指令72时实现上述各装置实施例中各模块/单元的功能,例如图6所示模块61至64的功能。FIG. 7 is a schematic diagram of a server according to an embodiment of the present application. As shown in FIG. 7, the server 7 of this embodiment includes a processor 70 and a memory 71 in which computer readable instructions 72, such as user feature data, are executable to run on the processor 70. program. When the processor 70 executes the computer readable instructions 72, the steps in the foregoing method for acquiring the respective user feature data are implemented, for example, steps 101 to 104 shown in FIG. Alternatively, the processor 70, when executing the computer readable instructions 72, implements the functions of the various modules/units in the various apparatus embodiments described above, such as the functions of the modules 61-64 shown in FIG.
示例性的,所述计算机可读指令72可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器71中,并由所述处理器70执行,以完成本申请。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机可读指令段,该指令段用于描述所述计算机可读指令72在所述服务器7中的执行过程。Illustratively, the computer readable instructions 72 may be partitioned into one or more modules/units that are stored in the memory 71 and executed by the processor 70, To complete this application. The one or more modules/units may be a series of computer readable instruction segments capable of performing a particular function for describing the execution of the computer readable instructions 72 in the server 7.
所述服务器7可以是桌上型计算机、笔记本、掌上电脑及云端服务器等计算设备。所述服务器可包括,但不仅限于处理器70和存储器71。本领域技术人员可以理解,图7仅仅是服务器7的示例,并不构成对服务器7的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述服务器还可以包括输入输出设备、网络接入设备、总线等。The server 7 can be a computing device such as a desktop computer, a notebook, a palmtop computer, and a cloud server. The server may include, but is not limited to, processor 70 and memory 71. It will be understood by those skilled in the art that FIG. 7 is only an example of the server 7, and does not constitute a limitation on the server 7, and may include more or less components than those illustrated, or combine some components, or different components, such as The server may also include an input and output device, a network access device, a bus, and the like.
所称处理器70可以是中央处理单元(Central
Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器 (Digital Signal Processor,DSP)、专用集成电路 (Application
Specific Integrated Circuit,ASIC)、现成可编程门阵列 (Field-Programmable Gate Array,FPGA) 或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The so-called processor 70 can be a central processing unit (Central
Processing Unit, CPU), can also be other general-purpose processors, digital signal processors (DSP), application specific integrated circuits (Application
Specific Integrated Circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
所述存储器71可以是所述服务器7的内部存储单元,例如服务器7的硬盘或内存。所述存储器71也可以是所述服务器7的外部存储设备,例如所述服务器7上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器71还可以既包括所述服务器7的内部存储单元也包括外部存储设备。所述存储器71用于存储所述计算机可读指令以及所述服务器所需的其他程序和数据。所述存储器71还可以用于暂时地存储已经输出或者将要输出的数据。The memory 71 may be an internal storage unit of the server 7, such as a hard disk or a memory of the server 7. The memory 71 may also be an external storage device of the server 7, such as a plug-in hard disk equipped with the server 7, a smart memory card (SMC), and a Secure Digital (SD) card. Flash card, etc. Further, the memory 71 may also include both an internal storage unit of the server 7 and an external storage device. The memory 71 is configured to store the computer readable instructions and other programs and data required by the server. The memory 71 can also be used to temporarily store data that has been output or is about to be output.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application, in essence or the contribution to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。The above embodiments are only used to explain the technical solutions of the present application, and are not limited thereto; although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still The technical solutions described in the embodiments are modified, or the equivalents of the technical features are replaced by the equivalents. The modifications and substitutions of the embodiments do not depart from the spirit and scope of the technical solutions of the embodiments of the present application.
Claims (20)
- 一种用户特征数据的获取方法,其特征在于,包括:A method for acquiring user feature data, comprising:将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;Synchronize user-based data in each business database to a distributed file system to build a user attribute library;利用计算引擎对已同步的用户基础数据进行分析汇总,得到与各个用户分别对应的特征数据;The calculation engine is used to analyze and summarize the synchronized user basic data, and the feature data corresponding to each user is obtained;基于所述用户属性库的接口,获取业务系统发出的数据访问请求;Obtaining a data access request sent by the service system based on the interface of the user attribute library;对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统。Verifying the validity of the data access request, and if the data access request is legal, returning the feature data of the user that matches the data access request to the service system.
- 如权利要求1所述的用户特征数据的获取方法,其特征在于,所述对所述数据访问请求的合法性进行验证,若所述访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统,包括:The method for acquiring user feature data according to claim 1, wherein the verifying the legality of the data access request, and if the access request is legal, the user matching the data access request The feature data is returned to the business system, including:判断所述数据访问请求所携带的IP地址以及访问密钥是否均存在于所述用户属性库的配置文件中;Determining whether the IP address carried by the data access request and the access key are present in a configuration file of the user attribute database;若所述数据访问请求所携带的IP地址以及访问密钥均存在于所述用户属性库的配置文件中,则获取所述数据访问请求中的属性信息,所述属性信息包括数据请求参数以及已加密的访问者签名;And if the IP address and the access key carried in the data access request are both in the configuration file of the user attribute database, acquiring attribute information in the data access request, where the attribute information includes a data request parameter and Encrypted visitor signature;对所述访问者签名进行解密处理,判断解密后的所述访问者签名与所述配置文件中的签名是否相同,并判断所述数据请求参数是否满足预设条件;Decrypting the visitor signature, determining whether the decrypted signature of the visitor is the same as the signature in the configuration file, and determining whether the data request parameter satisfies a preset condition;若解密后的所述访问者签名与所述配置文件中的签名相同,且所述数据请求参数满足预设条件,则确定所述数据访问请求合法,并将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。If the decrypted signature of the visitor is the same as the signature in the configuration file, and the data request parameter satisfies a preset condition, determining that the data access request is legal and matching the data access request The feature data is returned to the business system.
- 如权利要求1所述的用户特征数据的获取方法,其特征在于,所述将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库,包括:The method for acquiring user feature data according to claim 1, wherein the synchronizing user basic data in each service database to a distributed file system to construct a user attribute database comprises:基于JOB方式,以预设的时间间隔将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;Based on the JOB method, the user basic data in each service database is synchronized to the distributed file system at a preset time interval to construct a user attribute library;在所述利用计算引擎对已同步的所述用户基础数据进行分析汇总,得到各个用户分别对应的特征数据的步骤之后,还包括:After the step of analyzing and summarizing the synchronized user basic data by using the calculation engine to obtain the feature data corresponding to each user, the method further includes:将所述各个用户分别对应的特征数据保存至所述用户属性库的临时表,并在下一次将各个业务数据库中的用户基础数据同步至所述用户属性库之前,删除所述临时表中已存储的所述特征数据。Saving feature data corresponding to each user to a temporary table of the user attribute library, and deleting the stored in the temporary table before synchronizing the user basic data in each service database to the user attribute library next time The feature data.
- 如权利要求1所述的用户特征数据的获取方法,其特征在于,所述对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统,包括:The method for acquiring user feature data according to claim 1, wherein the verifying the legality of the data access request, and if the data access request is legal, matching the data access request Returning the feature data to the business system, including:获取所述数据访问请求的IP地址;Obtaining an IP address of the data access request;获取预设时长内基于所述IP地址的历史数据访问请求的总数;Obtaining a total number of historical data access requests based on the IP address within a preset duration;若预设时长内基于所述IP地址的历史数据访问请求的总数大于预设阈值,则对所述数据访问请求进行缓存,并在预设时延后对缓存的所述数据访问请求的合法性进行验证;And if the total number of historical data access requests based on the IP address in the preset duration is greater than a preset threshold, the data access request is cached, and the validity of the cached data access request is determined after a preset delay authenticating;若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。If the data access request is legal, the feature data matching the data access request is returned to the service system.
- 如权利要求1所述的用户特征数据的获取方法,其特征在于,所述将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库,包括:The method for acquiring user feature data according to claim 1, wherein the synchronizing user basic data in each service database to a distributed file system to construct a user attribute database comprises:在各个业务数据库中,将预设时间段内的用户基础数据同步至分布式文件系统,以构建用户属性库;Synchronizing user basic data in a preset time period to a distributed file system in each business database to construct a user attribute database;其中,所述预设时间段为当月第一天至当前时刻的前一天。The preset time period is the first day of the current month to the day before the current time.
- 一种用户特征数据的获取装置,其特征在于,包括:A device for acquiring user feature data, comprising:同步模块,用于将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;a synchronization module, configured to synchronize user basic data in each business database to a distributed file system to construct a user attribute library;汇总模块,用于利用计算引擎对已同步的所述用户基础数据进行分析汇总,得到各个用户分别对应的特征数据;a summary module, configured to analyze and summarize the synchronized user basic data by using a calculation engine, to obtain feature data corresponding to each user;获取模块,用于基于所述用户属性库的接口,获取业务系统发出的数据访问请求;An obtaining module, configured to acquire, according to an interface of the user attribute library, a data access request sent by a service system;返回模块,用于对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。And returning a module, configured to verify validity of the data access request, and if the data access request is legal, return the feature data that matches the data access request to the service system.
- 根据权利要求6所述的用户特征数据的获取装置,其特征在于,所述返回模块包括:The device for acquiring user feature data according to claim 6, wherein the returning module comprises:判断子模块,用于判断所述数据访问请求所携带的IP地址以及访问密钥是否均存在于所述用户属性库的配置文件中;a determining sub-module, configured to determine whether an IP address carried by the data access request and an access key are present in a configuration file of the user attribute database;第一获取子模块,用于若所述数据访问请求所携带的IP地址以及访问密钥均存在于所述用户属性库的配置文件中,则获取所述数据访问请求中的属性信息,所述属性信息包括访问者签名以及数据请求参数;a first obtaining submodule, configured to acquire attribute information in the data access request, if the IP address and the access key carried by the data access request are both in a configuration file of the user attribute database, The attribute information includes a visitor signature and a data request parameter;第一返回子模块,用于若解密后的所述访问者签名与所述配置文件中的签名相同,且所述数据请求参数满足预设条件,则所述数据访问请求合法,并将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。a first returning submodule, configured to: if the decrypted signature of the visitor is the same as the signature in the configuration file, and the data request parameter satisfies a preset condition, the data access request is legal, and the The feature data matched by the data access request is returned to the business system.
- 根据权利要求6所述的用户特征数据的获取装置,其特征在于,所述同步模块包括:The device for acquiring user feature data according to claim 6, wherein the synchronization module comprises:第一同步子模块,用于基于JOB方式,以预设的时间间隔将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;a first synchronization submodule, configured to synchronize user basic data in each service database to a distributed file system at a preset time interval based on a JOB manner to construct a user attribute database;所述用户特征数据的获取装置还包括:The device for acquiring user feature data further includes:存储模块,用于将所述各个用户分别对应的特征数据保存至所述用户属性库的临时表,并在下一次将各个业务数据库中的用户基础数据同步至所述用户属性库之前,删除所述临时表中已存储的所述特征数据。a storage module, configured to save feature data corresponding to each user to a temporary table of the user attribute library, and delete the user basic data in each service database to the user attribute database next time The feature data already stored in the temporary table.
- 根据权利要求6所述的用户特征数据的获取装置,其特征在于,所述返回模块包括:The device for acquiring user feature data according to claim 6, wherein the returning module comprises:第二获取子模块,用于获取所述数据访问请求的IP地址;a second obtaining submodule, configured to obtain an IP address of the data access request;第三获取子模块,用于获取预设时长内基于所述IP地址的历史数据访问请求的总数;a third obtaining submodule, configured to obtain a total number of historical data access requests based on the IP address within a preset duration;缓存子模块,用于若预设时长内基于所述IP地址的历史数据访问请求的总数大于预设阈值,则对所述数据访问请求进行缓存,并在预设时延后对缓存的所述数据访问请求的合法性进行验证;a cache submodule, configured to cache the data access request if the total number of historical data access requests based on the IP address is greater than a preset threshold within a preset duration, and to cache the Verify the legality of the data access request;第二返回子模块,用于若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。And a second returning submodule, configured to return the feature data that matches the data access request to the service system if the data access request is legal.
- 根据权利要求6任一项所述的用户特征数据的获取装置,其特征在于,所述同步模块包括:The device for acquiring user feature data according to any one of claims 6 to 3, wherein the synchronization module comprises:第二同步子模块,用于在各个业务数据库中,将预设时间段内的用户基础数据同步至分布式文件系统,以构建用户属性库。The second synchronization sub-module is configured to synchronize user basic data in a preset time period to a distributed file system in each service database to construct a user attribute library.其中,所述预设时间段为当月第一天至当前时刻的前一天。The preset time period is the first day of the current month to the day before the current time.
- 一种服务器,其特征在于,包括存储器以及处理器,所述存储器中存储有可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A server, comprising: a memory and a processor, wherein the memory stores computer readable instructions executable on the processor, the processor executing the computer readable instructions to:将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;Synchronize user-based data in each business database to a distributed file system to build a user attribute library;利用计算引擎对已同步的用户基础数据进行分析汇总,得到与各个用户分别对应的特征数据;The calculation engine is used to analyze and summarize the synchronized user basic data, and the feature data corresponding to each user is obtained;基于所述用户属性库的接口,获取业务系统发出的数据访问请求;Obtaining a data access request sent by the service system based on the interface of the user attribute library;对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统。Verifying the validity of the data access request, and if the data access request is legal, returning the feature data of the user that matches the data access request to the service system.
- 根据权利要求11所述的服务器,其特征在于,所述对所述数据访问请求的合法性进行验证,若所述访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统,包括:The server according to claim 11, wherein said verifying validity of said data access request, and if said access request is legal, returning feature data of a user matching said data access request to The business system includes:判断所述数据访问请求所携带的IP地址以及访问密钥是否均存在于所述用户属性库的配置文件中;Determining whether the IP address carried by the data access request and the access key are present in a configuration file of the user attribute database;若所述数据访问请求所携带的IP地址以及访问密钥均存在于所述用户属性库的配置文件中,则获取所述数据访问请求中的属性信息,所述属性信息包括数据请求参数以及已加密的访问者签名;And if the IP address and the access key carried in the data access request are both in the configuration file of the user attribute database, acquiring attribute information in the data access request, where the attribute information includes a data request parameter and Encrypted visitor signature;对所述访问者签名进行解密处理,判断解密后的所述访问者签名与所述配置文件中的签名是否相同,并判断所述数据请求参数是否满足预设条件;Decrypting the visitor signature, determining whether the decrypted signature of the visitor is the same as the signature in the configuration file, and determining whether the data request parameter satisfies a preset condition;若解密后的所述访问者签名与所述配置文件中的签名相同,且所述数据请求参数满足预设条件,则确定所述数据访问请求合法,并将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。If the decrypted signature of the visitor is the same as the signature in the configuration file, and the data request parameter satisfies a preset condition, determining that the data access request is legal and matching the data access request The feature data is returned to the business system.
- 根据权利要求11所述的服务器,其特征在于,所述将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库,包括:The server according to claim 11, wherein the synchronizing user basic data in each service database to a distributed file system to construct a user attribute database comprises:基于JOB方式,以预设的时间间隔将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;Based on the JOB method, the user basic data in each service database is synchronized to the distributed file system at a preset time interval to construct a user attribute library;在所述利用计算引擎对已同步的所述用户基础数据进行分析汇总,得到各个用户分别对应的特征数据的步骤之后,还包括:After the step of analyzing and summarizing the synchronized user basic data by using the calculation engine to obtain the feature data corresponding to each user, the method further includes:将所述各个用户分别对应的特征数据保存至所述用户属性库的临时表,并在下一次将各个业务数据库中的用户基础数据同步至所述用户属性库之前,删除所述临时表中已存储的所述特征数据。Saving feature data corresponding to each user to a temporary table of the user attribute library, and deleting the stored in the temporary table before synchronizing the user basic data in each service database to the user attribute library next time The feature data.
- 根据权利要求11所述的服务器,其特征在于,所述对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统,包括:The server according to claim 11, wherein said verifying validity of said data access request, and returning said feature data matching said data access request if said data access request is legal To the business system, including:获取所述数据访问请求的IP地址;Obtaining an IP address of the data access request;获取预设时长内基于所述IP地址的历史数据访问请求的总数;Obtaining a total number of historical data access requests based on the IP address within a preset duration;若预设时长内基于所述IP地址的历史数据访问请求的总数大于预设阈值,则对所述数据访问请求进行缓存,并在预设时延后对缓存的所述数据访问请求的合法性进行验证;And if the total number of historical data access requests based on the IP address in the preset duration is greater than a preset threshold, the data access request is cached, and the validity of the cached data access request is determined after a preset delay authenticating;若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。If the data access request is legal, the feature data matching the data access request is returned to the service system.
- 根据权利要求11所述的服务器,其特征在于,所述将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库,包括:The server according to claim 11, wherein the synchronizing user basic data in each service database to a distributed file system to construct a user attribute database comprises:在各个业务数据库中,将预设时间段内的用户基础数据同步至分布式文件系统,以构建用户属性库;Synchronizing user basic data in a preset time period to a distributed file system in each business database to construct a user attribute database;其中,所述预设时间段为当月第一天至当前时刻的前一天。The preset time period is the first day of the current month to the day before the current time.
- 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其特征在于,所述计算机可读指令被至少一个处理器执行时实现如下步骤:A computer readable storage medium storing computer readable instructions, wherein the computer readable instructions, when executed by at least one processor, implement the following steps:将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;Synchronize user-based data in each business database to a distributed file system to build a user attribute library;利用计算引擎对已同步的用户基础数据进行分析汇总,得到与各个用户分别对应的特征数据;The calculation engine is used to analyze and summarize the synchronized user basic data, and the feature data corresponding to each user is obtained;基于所述用户属性库的接口,获取业务系统发出的数据访问请求;Obtaining a data access request sent by the service system based on the interface of the user attribute library;对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统。Verifying the validity of the data access request, and if the data access request is legal, returning the feature data of the user that matches the data access request to the service system.
- 根据权利要求16所述的计算机可读存储介质,其特征在于,所述对所述数据访问请求的合法性进行验证,若所述访问请求合法,则将与所述数据访问请求匹配的用户的特征数据返回至所述业务系统,包括:The computer readable storage medium according to claim 16, wherein said verifying validity of said data access request, and if said access request is legitimate, matching said user of said data access request Feature data is returned to the business system, including:判断所述数据访问请求所携带的IP地址以及访问密钥是否均存在于所述用户属性库的配置文件中;Determining whether the IP address carried by the data access request and the access key are present in a configuration file of the user attribute database;若所述数据访问请求所携带的IP地址以及访问密钥均存在于所述用户属性库的配置文件中,则获取所述数据访问请求中的属性信息,所述属性信息包括数据请求参数以及已加密的访问者签名;And if the IP address and the access key carried in the data access request are both in the configuration file of the user attribute database, acquiring attribute information in the data access request, where the attribute information includes a data request parameter and Encrypted visitor signature;对所述访问者签名进行解密处理,判断解密后的所述访问者签名与所述配置文件中的签名是否相同,并判断所述数据请求参数是否满足预设条件;Decrypting the visitor signature, determining whether the decrypted signature of the visitor is the same as the signature in the configuration file, and determining whether the data request parameter satisfies a preset condition;若解密后的所述访问者签名与所述配置文件中的签名相同,且所述数据请求参数满足预设条件,则确定所述数据访问请求合法,并将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。If the decrypted signature of the visitor is the same as the signature in the configuration file, and the data request parameter satisfies a preset condition, determining that the data access request is legal and matching the data access request The feature data is returned to the business system.
- 根据权利要求16所述的计算机可读存储介质,其特征在于,所述将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库,包括:The computer readable storage medium according to claim 16, wherein the synchronizing user basic data in each service database to a distributed file system to construct a user attribute library comprises:基于JOB方式,以预设的时间间隔将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库;Based on the JOB method, the user basic data in each service database is synchronized to the distributed file system at a preset time interval to construct a user attribute library;在所述利用计算引擎对已同步的所述用户基础数据进行分析汇总,得到各个用户分别对应的特征数据的步骤之后,还包括:After the step of analyzing and summarizing the synchronized user basic data by using the calculation engine to obtain the feature data corresponding to each user, the method further includes:将所述各个用户分别对应的特征数据保存至所述用户属性库的临时表,并在下一次将各个业务数据库中的用户基础数据同步至所述用户属性库之前,删除所述临时表中已存储的所述特征数据。Saving feature data corresponding to each user to a temporary table of the user attribute library, and deleting the stored in the temporary table before synchronizing the user basic data in each service database to the user attribute library next time The feature data.
- 根据权利要求16所述的计算机可读存储介质,其特征在于,所述对所述数据访问请求的合法性进行验证,若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统,包括:The computer readable storage medium according to claim 16, wherein said verifying the legality of said data access request, and if said data access request is legitimate, matching said data access request Returning the feature data to the business system, including:获取所述数据访问请求的IP地址;Obtaining an IP address of the data access request;获取预设时长内基于所述IP地址的历史数据访问请求的总数;Obtaining a total number of historical data access requests based on the IP address within a preset duration;若预设时长内基于所述IP地址的历史数据访问请求的总数大于预设阈值,则对所述数据访问请求进行缓存,并在预设时延后对缓存的所述数据访问请求的合法性进行验证;And if the total number of historical data access requests based on the IP address in the preset duration is greater than a preset threshold, the data access request is cached, and the validity of the cached data access request is determined after a preset delay authenticating;若所述数据访问请求合法,则将与所述数据访问请求匹配的所述特征数据返回至所述业务系统。If the data access request is legal, the feature data matching the data access request is returned to the service system.
- 根据权利要求16所述的计算机可读存储介质,其特征在于,所述将各个业务数据库中的用户基础数据同步至分布式文件系统,以构建用户属性库,包括:The computer readable storage medium according to claim 16, wherein the synchronizing user basic data in each service database to a distributed file system to construct a user attribute library comprises:在各个业务数据库中,将预设时间段内的用户基础数据同步至分布式文件系统,以构建用户属性库;Synchronizing user basic data in a preset time period to a distributed file system in each business database to construct a user attribute database;其中,所述预设时间段为当月第一天至当前时刻的前一天。The preset time period is the first day of the current month to the day before the current time.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710282779.4A CN107798037A (en) | 2017-04-26 | 2017-04-26 | The acquisition methods and server of user characteristic data |
CN201710282779.4 | 2017-04-26 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018196650A1 true WO2018196650A1 (en) | 2018-11-01 |
Family
ID=61531037
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/083298 WO2018196650A1 (en) | 2017-04-26 | 2018-04-17 | User feature data acquisition method and device, server, and medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107798037A (en) |
WO (1) | WO2018196650A1 (en) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107798037A (en) * | 2017-04-26 | 2018-03-13 | 平安科技(深圳)有限公司 | The acquisition methods and server of user characteristic data |
CN108959337A (en) * | 2018-03-22 | 2018-12-07 | 中国平安人寿保险股份有限公司 | Big data acquisition methods, device, equipment and storage medium |
CN109375913B (en) * | 2018-09-11 | 2022-04-08 | 中铁程科技有限责任公司 | Data processing method and device |
CN110990466B (en) * | 2018-09-29 | 2023-04-07 | 北京国双科技有限公司 | Data synchronization method and device |
CN111367782B (en) * | 2018-12-25 | 2023-08-15 | 中国移动通信集团浙江有限公司 | Method and device for automatically generating regression test data |
CN110336791B (en) * | 2019-05-29 | 2022-02-22 | 平安科技(深圳)有限公司 | Method, device and equipment for transmitting breakpoint data and computer storage medium |
CN110598993B (en) * | 2019-08-19 | 2023-04-18 | 深圳市鹏海运电子数据交换有限公司 | Data processing method and device |
CN110515967B (en) * | 2019-08-30 | 2020-09-08 | 望海康信(北京)科技股份公司 | Spark calculation framework-based data analysis method and electronic equipment |
CN111813405A (en) * | 2020-06-29 | 2020-10-23 | 四川长虹电器股份有限公司 | Method for rapidly updating and publishing target data interface |
CN112364021B (en) * | 2020-11-10 | 2023-10-13 | 中国平安人寿保险股份有限公司 | Service data processing method, device and storage medium |
CN113987324B (en) * | 2021-10-21 | 2024-11-01 | 北京达佳互联信息技术有限公司 | Data processing method, device, equipment and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103034650A (en) * | 2011-09-29 | 2013-04-10 | 北京新媒传信科技有限公司 | System and method for processing data |
CN103136335A (en) * | 2013-01-31 | 2013-06-05 | 北京千分点信息科技有限公司 | Data control method based on data platforms |
US20150332337A1 (en) * | 2014-05-14 | 2015-11-19 | Korea Electronics Technology Institute | Technology of assisting context based service |
CN105512336A (en) * | 2015-12-29 | 2016-04-20 | 中国建设银行股份有限公司 | Method and device for mass data processing based on Hadoop |
CN106547766A (en) * | 2015-09-18 | 2017-03-29 | 华为技术有限公司 | A kind of data access method and device |
CN107798037A (en) * | 2017-04-26 | 2018-03-13 | 平安科技(深圳)有限公司 | The acquisition methods and server of user characteristic data |
-
2017
- 2017-04-26 CN CN201710282779.4A patent/CN107798037A/en active Pending
-
2018
- 2018-04-17 WO PCT/CN2018/083298 patent/WO2018196650A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103034650A (en) * | 2011-09-29 | 2013-04-10 | 北京新媒传信科技有限公司 | System and method for processing data |
CN103136335A (en) * | 2013-01-31 | 2013-06-05 | 北京千分点信息科技有限公司 | Data control method based on data platforms |
US20150332337A1 (en) * | 2014-05-14 | 2015-11-19 | Korea Electronics Technology Institute | Technology of assisting context based service |
CN106547766A (en) * | 2015-09-18 | 2017-03-29 | 华为技术有限公司 | A kind of data access method and device |
CN105512336A (en) * | 2015-12-29 | 2016-04-20 | 中国建设银行股份有限公司 | Method and device for mass data processing based on Hadoop |
CN107798037A (en) * | 2017-04-26 | 2018-03-13 | 平安科技(深圳)有限公司 | The acquisition methods and server of user characteristic data |
Also Published As
Publication number | Publication date |
---|---|
CN107798037A (en) | 2018-03-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018196650A1 (en) | User feature data acquisition method and device, server, and medium | |
CN110543464B (en) | Big data platform applied to intelligent park and operation method | |
CN106713332B (en) | Network data processing method, device and system | |
JP6730520B2 (en) | Immutable database supported by a cryptographically protected ledger | |
Bates et al. | Towards secure provenance-based access control in cloud environments | |
CN109643358B (en) | Cross-tenant data leak isolation | |
RU2475988C2 (en) | Method and system to use local cash supported with host node and cryptographic hash functions in order to reduce network traffic | |
CN114625767B (en) | A data query method, device, equipment and readable medium | |
US11809397B1 (en) | Managing slot requests for query execution in hybrid cloud deployments | |
TW202013234A (en) | Data processing method, device and storage medium | |
WO2021068488A1 (en) | Blockchain-based log processing method and apparatus, computer device, and storage medium | |
Essa et al. | IFHDS: intelligent framework for securing healthcare bigdata | |
KR102207072B1 (en) | System for auditing data access based on block chain and the method thereof | |
US20060272008A1 (en) | Method and security system for indentifying and blocking web attacks by enforcing read-only parameters | |
US11487865B2 (en) | Method and system for SSIS password management integration | |
CN110990335B (en) | Log archiving method, device, equipment and computer-readable storage medium | |
US11757837B2 (en) | Sensitive data identification in real time for data streaming | |
CN111241104A (en) | Operation auditing method and device, electronic equipment and computer-readable storage medium | |
US11297105B2 (en) | Dynamically determining a trust level of an end-to-end link | |
US20150358342A1 (en) | Malicious website access method and apparatus | |
CN114598749A (en) | Service access method and device | |
Dinh et al. | City on the sky: extending xacml for flexible, secure data sharing on the cloud | |
WO2025152628A1 (en) | Application programming interface asset management method and apparatus, and electronic device | |
US11899685B1 (en) | Dividing authorization between a control plane and a data plane for sharing database data | |
CN112511515B (en) | Chain number cube for data chaining |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18790069 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 20.01.2020) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18790069 Country of ref document: EP Kind code of ref document: A1 |